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RECOMBINANT ADENOVIRAL VECTORS FOR CELL SPECIFIC 
INFECTION AND GENOME INTEGRATION AND EXPRESSING CHIMERIC 
5 FIBER PROTEINS 

This invention was made, at least in part, with funding from the National Institutes of 
Health (Grant Nos. R01 CA 80192-01 and R21 DK 55590-01). Accordingly, the United 
to States Government has certain rights in this invention. 

FIELD OF THE INVENTION 

This invention relates to the field of gene therapy, and in particular, to novel adenovirus 
15 (Ad) vectors that selectively infect cells for gene therapy, and to Ad vectors containing 
modifications of the fiber protein to allow retargeting of any adenovirus serotype. 

BACKGROUND OF THE INVENTION 

20 Gene transfer vectors require the efficient transduction of target cells, stable association 
with the host genome, and adequate transgene expression in the appropriate target cell, 
without associated toxic or immunological side effects,. Currently available viral vector 
systems, including recombinant retroviruses, adenoviruses and adeno-associated viruses, 
are not suitable for efficient gene transfer into many cell types. Retroviral vectors require 

25 cell division for stable integration. Recombinant adenoviruses are not able to infect many 
cell types important for gene therapy, including hematopoietic stem cells, monocytes, T- 
and B-lymphocytes. Moreover, recombinant adeno-associated vectors (AAV) integrate 
with a low frequency. 

30 First generation adenoviruses have a number of properties that make them an attractive 
vehicle for gene transfer (Hitt, M.M. et al. 1997 Advances in Pharmacology 40:137-205). 
These include the ability to produce purified virus at high titers in concert with highly 
efficient gene transfer of up to 8 kb long expression cassettes into a large variety of cell 
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types in vivo, including non-dividing cells. Limitations of first generation adenoviruses 
include the development of immune responses to expressed viral proteins resulting in 
toxicity and virus clearance. The episomal status of adenoviral DNA within transduced 
cells is another limitation of first generation Ad vectors. Stable integration of adenovirus 
5 DNA into the host genome is reported only for wild-type forms of specific subtypes and 
appears not to occur in a detectable manner with El/E3-deleted Ad 5 (adenovirus 
serotype 5) vectors widely used for gene transfer in vitro and in vivo [Hitt, M.M. et al. 
1 997 Advances in Pharmacology 40: 1 37-205]. 

10 Recombinant AAV vectors (rAAV) integrate with a low frequency (about 1 out of 20,000 
genomes) randomly as cocatemers into the host genome (Rutledge, E.A.; Russel, D.W. 
1997 J. Virology, 71, 8429-8436). The presence of two AAV inverted terminal repeats 
(ITRs) and as yet unknown host cellular factors seem to be the only requirement for 
vector integration (Xiao, X., et al, 1997, J. Virology, 71, 941-948; Balague, C, et al. 

15 1997, J. Virology, 71, 3299-3306; Yang, C.C. 1997, J. Virology, 71, 9231-9247). In the 
presence of the large AAV Rep proteins, AAV integrates preferentially into a specific site 
at human chromosome 19, called AAVS1 (Berns, K.L, 1996. Fields Virology, Fields, 
B.N. et al. (ed) Vol. 2, Lippincott-Raven, Philadelphia, PA, 2173-2220). The AAV 
capsid is formed by three coat proteins (VP1-3), which interact with specific heparin 

20 sulfates on the cell surface and probably with specific receptor(s). However, many cell 
types, including hematopoietic stem cells, lack these structures so that rAAV vectors 
based on AAV2 cannot infect or transduce these cells (Malik P. et al., 1997, J. Virology, 
71, 1776-1783; Quing, K.Y., et al. 1998, J. Virology, 72, 1593-1599). Other 
disadvantages of rAAV vectors include the limited insert size (4.5-5kb) that can be 

25 accommodated in rAAV vectors lacking all viral genes and low transducing titers of 
rAAV preparations. 

Adenovirus infection is initiated by attaching to the cell surface of Ad 5 via its fiber 
protein (for a review, see Shenk, T. 1996 Fields Virology, Vol 2, Fields, B.N. et al. (ed) 
30 Vol. 2, Lippincott-Raven, Philadelphia, PA, 21 1 1-2148). The distal, C-terminal domain 
of the trimeric fiber molecule terminates in a knob, which binds to a specific cellular 
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receptor identified recently as the coxackie-adenovirus receptor (CAR) (Bergelson, J.M. 
et al. Science, 275, 1320-1323). After binding, in an event independent of virus 
attachment, Arg-GIy-Asp (RGD) motifs in the penton base interact with cellular integrins 
of the a3 and P5 types. This interaction triggers cellular internalization whereby the 

5 virion achieves localization within the endosome. The endosomal membrane is lysed in a 
process mediated by the penton base, releasing the contents of the endosome to the 
cytoplasm. During these processes, the virion is gradually uncoated and the adenoviral 
DNA is transported to the nucleus where replication takes place. The terminal protein, 
which is covalently attached to the viral genome and the core protein V that is localized 

10 on the surface of the cores have nuclear localization signals (NLSs) (van der Vliet, B. 
1995, The Molecular Repertoir of Adenoviruses, Vol. 2, Doerfler, W. and Boehm, P.(ed.), 
Springer Verlag, Berlin. 1-31). These NLSs play a crucial role in directing the adenoviral 
genome to the nucleus and probably represent the structural elements which allow 
adenovirus to transduce non-dividing cells. When the double-stranded, linear DNA 

15 reaches the nucleus, it binds to the nuclear matrix through its terminal protein. 

Since the cell types that can be infected with Ad5 or Ad2 vectors are restricted by the 
presence of CAR and specific integrins, attempts have been made to widen the tropism of 
Ad vectors. Genetic modification of adenovirus coat proteins to target novel cell surface 

20 receptors have been reported for the fiber (Krasnykh, V. et al. 1998 J. Virology, 72, 1844- 
1852, Krasnykh, V. et al. 1996 J. Virology, 70, 6839-6846, Stevenson, S.D., et al. 1997, 
1 Virology, 71, 4782-4790), penton base (Wickham, T.J., et al. 1996, J. Virology, 70, 
6831-6838; Wickham, T.J., et al. 1995, Gene Therapy, 69, 750-756), and hexon proteins 
(Crompton, J., et al. 1994, J. Gen. Virol 75,133-139). The most promising modification 

25 seems to be the functional modification of the fiber protein or more specifically of the 
fiber knob as the moiety, which mediates the primary attachment. Two groups have 
reported the generation of fibers consisting of the Ad5 tail/shaft and the knob domain of 
Ad3 (Krasnykh, V. et al. 1996 supra, Stevenson, S.D., et al. 1997, supra). Recently, 
recombinant adenoviruses with fibers containing C-terminal poly-lysine, gastrin-releasing 

30 peptide, somatostatin, E-selectin-binding peptide, or oligo-Histidines were produced in 
order to change the native tropism of Ad5. Krasnikh et al. found (Krasnykh, V. et al. 
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1998 supra) that heterologous peptide ligands could be inserted into the HI loop of the 
fiber knob domain without affecting the biological function of the fiber. Based on studies 
with other Ad serotypes, it appears that the length of the fiber shaft is a critical element, 
determining the efficiency of interaction with cell surface integrins and the internalization 
5 process. Thus far, there is no reported data demonstrating successful retargeting of Ad5 
vectors for a specific cell type. 

Therefore, there is a present need for an improved adenovirus vector which can be 
targeted efficiently to a variety of cell types and tissues and remain stably integrated in 

10 the host genome with minimal antigenicity to the host. The present invention discloses 
novel chimeric adenoviral (Ad) Ad-AAV vectors, which express a modified fiber protein 
on their capsid, for specifically targeting the vector. Methods of making, uses and 
advantages of these vectors are described. In addition, the alteration described for the 
knob and shaft domains of the fiber protein provide a novel approach to retarget any 

15 adenovirus serotype for cell specific infection. 

SUMMARY OF THE INVENTION 

The present invention provides for novel chimeric Ad- vectors carrying transgene, or 
20 portions of transgenes for stable and efficient gene transfer into diverse cell types or 
tissues in a CAR- and/or oc o P3/s- independent manner. Also provided are methods for 
producing such vectors and the use thereof for gene therapy to target a specific cell type 
or tissue. 

25 The recombinant adenovirus vectors of the invention (Example I) provide a novel design 
that allows for the easy production and delivery of a "gutless" adenoviral vector with the 
added advantage of stable integration of the transgene into the host genome of different 
cell type. The adenoviral vector described is devoid of all adenoviral sequences except 
for the 5' and 3* cis elements necessary for replication and virion encapsidation. The 

30 adenovirus-associated virus sequences of the invention comprising the 5 r (right) and 3' 
(left) inverted terminal repeats (ITRs) flank the transgene gene cassette such that they 
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direct homologous recombination during viral replication and viral integration into the 
host genome. In one embodiment AAV-ITR flanking sequences are used. The vector 
also contains a selected transgene(s) operably linked to a selected regulatory element and 
a polydenylation stop signal, which is in turn flanked by the flanking sequences described 
5 above. The selected transgene(s)can be linked under the same regulatory elements or 
under separate regulatory elements in the same orientation or in opposite orientations 
with respect to each other. The selected transgene(s) are any gene or genes which are 
expressed in a host cell or tissue for therapeutic, reporter or selection purposes. This 
vector is characterized by high titer transgene delivery to a host cell and the ability to 
10 stably integrate the transgene into the host genome. Also provided is a method to 
improve the integration frequency and site specific integration by incorporating an AAV 
rep protein into the recombinant hybrid vector. 

The invention also provides chimeric fiber proteins (Example II), which includes 
15 naturally occurring fiber proteins in which a portion or portions of the sequence are 
modified to alter cell or tissue specificity of infection. Altered fiber protein sequences 
can include fiber protein domains (the knob domain, the shaft domain, and the tail 
domain) from other or the same adenovirus serotypes or from randomly selected 
peptides. A chimeric fiber protein can be entirely composed of non-naturally occurring 
20 sequences. The invention further relates to nucleic acid sequences encoding the chimeric 
fiber proteins. These nucleic acid sequences can be naturally occurring, a mixture of 
naturally occurring and non-naturally occurring sequences, or entirely non-naturally 
occurring sequences. 

25 The heterologous fiber protein sequences described herein can be inserted into any 
adenovirus based vector which contains a capsid, rendering the virus capable of 
specifically infecting a given cell or tissue. Adenoviral vectors having such a 
heterologous fiber sequence can be used to direct gene transfer into desired cells. For 
stable integration of the transgene cassette into the host gemone, the chimeric Ad-AAV 

30 vector described in the invention is the preferred vector of use. 
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The invention also includes a library of adenoviruses displaying random peptides in their 
fiber knobs can be used as ligands to screen for an adenovirus variant with tropism to a 
particular cell type in vitro and in vivo. 

5 The chimeric Ad- vectors described herein include the Ad.AAV genome with a modified 
fiber protein expressed on its capsid. These chimeric vectors are designed to infect a 
wide variety of cells, in particular, the cells which can only be poorly transduced by the 
commonly used retroviral, AAV and adenoviral vectors. These cells include, but not 
limited to, hematopoietic stem cells, lung epithelial cells, dendritic cells, lymphoblastoid 

10 cells, and endothelial cells. Hematopoietic stem cells such as CD34+ cells can be targeted 
for gene therapy of sickle cell anemia and thalasemia using the vector described herein. 
The chimeric Ad-AAV vector capable of transducing genes into endothelial cells can be 
used in gene therapy for vascular diseases such as atherosclerosis or restinosis after 
coronary artery surgery. 



BRIEF DESCRIPTION OF THE FIGURES 

Figs. 1 A- 1 C display a proposed mechanism for forming of A Ad. AAV 1 genome. 

20 Figs. 2A and 2B show electron photomicrographs of hybrid virus particles: Fig. 2A 
shows Ad. AAV 1 and Fig. 2B shows AAd.AAVl. 

Fig. 3 illustrates analysis of AAd.AAVl genomes after transduction of SKHepl cells. 
Pulse field gel eletrophoresis (PFGE). lxlO 6 control Sk Hepl cells (SKHepl) (lanes 1-3, 

25 5, 9). SKHepl cells from G418 resistant pools (AAd.AAVl) (infected with AAd.AAVl 
and selected for 4 weeks) (lanes 6-8, 10-12), or SKHepl cells collected at 3 days after 
infection with 2000 genomes Ad. AAV 1 (Ad) lanes 4, 13) are sealed in agarose plaques, 
lysed in situ and subjected to PFGE with or without prior digestion with restriction 
endonucleases. Southern Blot is performed with a SEAP specific probe. U = undigested, 

30 P = digested with PI- Seel , I = I-Ceul, E = EcoREI. 
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Figs. 4A and 4B show response of K562 and CD34+ cells respectively after infection 
with AAd.AAVBG. Cells are incubated for 6 hours with virus under agitation. At day 3 
after infection, transduction frequency is calculated based on the number of X-Gal 
positive cells. Viability is tested by trypan blue exclusion. N = 3, SEM < 10%. 



Fig. 5 shows Rep expression in SKHepl and 293 cells after plasmid transfection. 5x10 
cells are transfected with pAAV/Ad, pRSVrep, or pPGKrep by Ca-phosphate co- 
precipitation. Three days after transfection, cells are harvested. Lysates are separated on 
a 10% PA gel. followed by Western Blot with Rep specific antibodies (03-65169), 
10 American Research Products), and developed with ECL (Amersham). 

Fig. 6 shows detection of vector integration into AAVS1 by PFGE. 

Fig. 7 shows strategy for creating an AAd.AAV hybrid vector capable of site-specific 
15 integration. Arrows indicate promoters, (PA) = polyadenylation signal. 4* = adenoviral 
packaging signal. 

Fig. 8A-8B shows vectors for transduction studies with SNori as expression unit and 
analysis of vector integration on genomic DNA from a small cell number. Analgous 
20 vector sets can be generated with P-galactosidase (BG) or green fluorescence protein 
(GFP) as reporter genes. 

Fig. 9 shows strategy for substituting the Ad5 fiber sequence by the heterologous fiber X 
genes using recombination in E. colt 



Fig. 10 shows the expression of CAR and ct v -integrins on test ceils. For flow cytometry 
analysis, HeLa, CHO, K562, and CD34+ cells were incubated with monoclonal anti-CAR 
(RmcB, 1:400 dilution) or anti-a v -integrin antibodies (L230, 1:30 dilution). As a negative 
control, cells were incubated with an irrelevant mouse monoclonal antibody (anti-BrdU, 
30 1:100 dilution). The binding of primary antibodies was developed with anti-mouse IgG- 
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FITC labeled conjugates (1:100 dilution). Data shown represent the average results of 
quadruplicate analyses performed on 10 4 cells. 



Fig. 1 1 shows the electron microscopy of adenovirus particles. Purified particles from 
Ad5, 9, and 35 were negative contrast stained and analyzed at a magnification of 
85,000x. Defective particles are highlighted by arrows. 

Fig. 12 shows the analysis of attachment and internalization of different serotypes to 
CHO, HeLa, K562, and CD34+ cells. Equal amounts of [ 3 H]-thymidine-labeled virions 
of Ads 3, 4, 5, 9, 35, and 41 (measured by OD 2 6o, and equivalent to an MOI of 400 pfu 
per cell for Ad5) were incubated for one hour on ice as described in Materials and 
Methods. Cells were then washed, and the number of labeled virions bound per cell was 
determined. For internalization studies, viruses were first allowed to attach to cells for 1 h 
on ice. Then, unbound viral particles were washed out. Cells were then incubated at 37°C 
for 30 min followed by treatment with trypsin-EDTA and washing to remove 
uninternalized viral particles. The data were obtained from two to four independent 
experiments performed in triplicate. Note the different scale on the Y-axes for CD34+ 
cells. 

Figs. 13A-13C show attachment and internalization of different adenovirus serotypes to 
Hela, CHO and 293 cells respectively. 

Figs. 14A and 14B show attachment and internalization of different adenovirus serotypes 
to CD34+ and K-562 cells respectively. 

Fig. 15A-15C shows the analysis of viral replication in K562 and CD34+ cells by 
Southern blot analysis of methylated viral DNA. Replication studies were performed 
with lxl 0 5 K562 cells (A) or CD34+ cells (B), infected with methylated Ad5, Ad9 or 
Ad35. The lane labeled as "load" represents DNA that was extracted form the media/cell 
mixture immediately after adding the indicated viral dose to cells. The intensities of 
bands corresponding to methylated and un-methylated viral DNA indicate that -85% of 
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the input virus was methylated. To quantify adsorption and internalization, DNA analysis 
was performed after prior incubation of virus with cells at 0°C (adsorption) or 37°C 
(internalization). For dose dependent replication studies, the indicated viral dose 
(expressed as the number of genomes) was added to the cells, and cellular genomic DNA 

5 together with viral DNA was extracted 16 hours or 36 hours post-infection for K562 and 
CD34+cells, respectively. Identical amounts of sample DNA were analyzed by Southern 
blot. For quantification purposes, Ad9 replication was analyzed together with Ad5 using 
an Ad5/9 chimeric probe that hybridizes with DNA of both serotypes (C). The analysis of 
Ad5 versus Ad35 replication was performed with the corresponding Ad5/35 chimeric 

10 probe. Since separate hybridizations with both Ad5/35 and Ad5/9 probes gave identical 
signal intensities for Ad5 DNA only one panel is shown for Ad5 replication in test cells. 
To produce distinguishable fragments specific for the methylated or non-methylated 
status of viral genomes, Ad5 DNA was digested with Xho h while Ad9 and Ad35 DNA 
was digested with Xho I and Hind III. The bands specific for methylated (not-replicated) 

15 viral DNA were ~12kb for Ad9, 35kb for Ad5 ? and M2kb for Ad35. The fragments 
specific for non-methylated DNA were 5.8kb for Ad9, 6.1kb for Ad5, and 9.5kb for 
Ad35. Chimeric Ad5/9 and Ad5/35 DNA fragments (1.8kb) were used as quantification 
standards and applied onto gel together with digested viral/cellular DNA (shown on the 
left part of the figures). 



Fig. 16A-16B shows the structure of Ad5GFP and chimeric Ad5GFP/F35 vectors. A) 
Schematic diagram of the original E1/E3 deleted Ad5-based vector with GFP-expression 
cassette inserted into the E3 region (AdSGFP) and the chimeric vector Ad5GFP/F35 
containing the Ad5/35 fiber gene. The 2.2kb Ad5 fiber gene was replaced by a 0.9kb 

25 chimeric fiber gene encoding for the short shaft and knob of Ad35 by a technique that 
involved PCR-cIoning and recombination in E.coli. Kpn I (K) and Hind III (H) sites 
localized within or around the fiber genes are indicated. The lower panel shows the 
detailed structure of the chimeric fiber region. The Ad5 fiber tail [amino acids (aa): 1-44J 
were joined in frame to the Ad35 fiber shaft starting from its first two amino acids (GV), 

30 which are conserved among many serotypes. A conserved stretch of amino acids TLWT 
marks the boundary between the last P-sheet of Ad35 shaft and the globular knob. The 
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Ad35 fiber chain termination codon is followed by the Ad5 fiber poly-adenylation signal. 
The region of Ad5GFP/F35 encoding for chimeric fiber was completely sequenced with 
Ad5 specific primers (see Material and Methods). B) Restriction analysis of viral 
genomes. Viral DNA was isolated from purified Ad5GFP and Ad5GFP/F35 particles as 
5 described elsewhere. One microgram of DNA was digested with Hind III or Kpn I and 
separated in ethidium bromide stained agarose gels (left panel) which were subsequently 
blotted and analyzed by Southern blot with an Ad5 E4 specific probe (nt 32,7775-33,651) 
(right panel). Specific patterns, designating the correct structure for both viral vectors 
were detected. The Hind III fragments specific for Ad5GFP and Ad5GFP/F35 were 2.9kb 
10 and 4.9kb, respectively. The Kpn I fragment that confirmed the correct Ad5GFP/F35 
structure was 1.6kb compared to a 7.6kb Ad5GFP fragment. M - lkb ladder (Gibco-BRL, 
Grand Island, NY). 

Fig. 17 shows the generation of AAd.AAV genenomes by recombination between 
15 inverted homology regions. Recombination between two inverted repeats (IR) present in 
one Ad.AAV vector. The first-generation Ad.AAV vector (~34kb) contains two 1.2kb 
inverted homology elements flanking the transgene cassette. One AAV-ITR is inserted 
between the Ad packaging signal (v|/) and the left IR. During Ad replication, 
recombination between the Irs mediates the formation of the AAd.AAV genomes with the 
20 transgene flanked by Irs, AAVITRs, Ad packaging signals, and Ad ITRs. These 
genomes are efficiently packaged into Ad capsids. 

Fig. 18 shows the structure of Ad5/1 1 ? Ad5/35. Schematic diagram of the original E1/E3 
deleted AdSObased vector with GFP-expression cassette inserted into the E3 region 

25 (Ad5GFP) and the chimeric vector Ad5GFP/F35 containing the Ad5/35 fiber gene. The 
2.2kb Ad5 fiber gene was replaced by a 0.9kb chimeric fiber gene encoding for the short 
shaft and knob of Ad35 by a technique that involved PCR-cloning and recombination in 
E.col. Kpn I (K) and Hind III (H) sites localized within or around the fiber genes are 
indicated. The lower panel shows the detailed structure of the chimeric fiber region. The 

30 Ad5 fiber tail [amino acids (aa): 1-44] were joined in frame to the Ad35 fiber shaft 
starting from its first two amino acids (GV), which are conserved among many serotypes. 
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A conserved stretch of amino acids TLWT marks the boundary between the last P-sheet 
of Ad35 shaft and the globular knob. The Ad35 fiber chain termination codon is 
followed by the Ad5 fiber poly-adenylation signal. 

5 Fig. 19 shows the cross-competition for attachment and internalization of labeled 
Ad5GFP, Ad35, and chimeric Ad5GFP/F35 virions with unlabeled viruses, and with anti- 
CAR or anti-a v -integrins Mab. (A) For attachment studies, 10 5 K562 cells were pre- 
incubated with a 100- fold excess of unlabeled competitor virus at 4°C for 1 h. Then, 
equal amounts of [ 3 H]-labeled viruses, at a dose equivalent to an MOI of 100 pfu per cell 

10 determined for Ad5GFP, were added to cells followed by incubation at 4°C for 1 h. Cells 
were then washed with ice-cold PBS, pelleted and the percentage of attached virus (cell- 
associated counts per minute) was determined. For analysis of cross-competition for 
internalization, cells were pre-incubated with a 100-fold excess of competitor virus at 
37°C for 30 min before labeled virus was added. After an additional incubation at 37°C 

15 for 30 min, cells were treated with trypsin-EDTA for 5 min at 37°C, washed with ice-cold 
PBS, pelleted, and the percentage of internalized virus was determined. For controls, cells 
were incubated with labeled viruses without any competitors. Preliminary experiments 
had shown that the conditions chosen for competition studies allowed for saturation in 
attachment/internalization on K562 cells for all unlabeled competitors. (B) 10 s K562 cells 

20 were pre-incubated for 1 hour at 4°C with anti-CAR MAb (RmcB, diluted 1 :100) or with 
anti-otv-integrin MAb (L230, diluted 1:30), followed by incubation with labeled viruses 
according to the protocols for attachment or for internalization as described above. For 
each particular serotype, the percentage of attached/internalized virus was compared to 
the control settings, where cells were preincubated under the same conditions with a 

25 1:100 dilution of an irrelevant antibody (anti-BrdU Mab) before addition of the labeled 
virus. Note that the specific competitors but not the corresponding controls significantly 
inhibited Ad5 internalization to a degree that is in agreement with published data (59). 
N>/=4 

30 Fig. 20. Cross-competition for attachment and internalization of [ 3 H]-labeled AdSGFP, 
Ad35, and chimeric Ad5GFP/F35 virions with unlabeled Ad3 virus (A), and of [ 3 H]- 
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labeled Ad3 virions with unlabeled viruses (B). 10 5 K562 cells were pre-incubated with 
a 100- fold excess of unlabeled viral particles according to attachment or internalization 
protocols described for Fig.6. Equal amounts of [ 3 H]-labeled AdSGFP, Ad5GFP/F35, or 
Ad35 (A) or [ 3 H]-labeled Ad3 (B) were added to cells at a dose equivalent to an MOI of 
5 100 pfu per cell for AdSGFP. In control settings, cells were incubated with labeled 
viruses without any competitors. N=4. 

Fig. 21 shows the transduction of CD34+, K562, and HeLa cells with AdSGFP and 
chimeric Ad5GFP/F35 vectors. lxlO 5 cells were infected with different MOIs (pfu/cell) 
10 of viruses in 100 jal of media for 6 hours at 37°C. Virus containing media was then 
removed, and the cells were resuspended in fresh media followed by incubation for 18 h 
at 37°C. The percentage of GFP expressing cells was determined by flow cytometry. N=3 



Fig. 22 shows the distribution of GFP-positive cells in subpopulations of human CD34+ 
15 cells expressing CAR or a v -integrins. lxlO 5 CD34+ cells were infected with AdSGFP or 
Ad5GFP/F35 at an MOI of 200 pfu/cell as described for Fig.8. Twenty-four hours after 
infection, cells were incubated with anti-CAR (1:100 final dilution) or anti-a v -integrin 
(1:30 final dilution) primary MAbs for 1 h at 37°C. Binding of primary antibodies was 
developed with anti-mouse IgG-PE labeled secondary MAbs (1:100 final dilution) at 4°C 
20 for 30 min. For each variant, 10 4 cells were analyzed by flow-cytometry. The mock 
infection variants represent cells incubated with virus dilution buffer only. The quadrant 
borders were set based on the background signals obtained with both the GFP- and PE- 
matched negative controls. The percentages of stained cells found in each quadrant are 
indicated. The data shown were representative for three independent experiments. 

25 

Fig. 23A-23B shows the distribution of GFP-positive cells in a subpopulation of human 
CD34+ cells, expressing CD34 and CD117 (c-kit). (A) Co-localization of GFP 
expression with CD34 or CD 117: CD34+ cells were infected with AdSGFP or 
Ad5GFP/F35 at an MOI of 200 pfu per cell under the conditions described for Fig.8. 
30 Twenty-four hours after infection, cells were incubated with anti-CD34 PE-conjugated 
MAbs (final dilution 1:2) or with anti-CD117 PE-conjugated MAbs (final dilution 1:5) 
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for 30 min on ice, and 10 4 cells per variant were subjected to two-color flow cytometry 
analysis. For negative control staining, no antibodies were added to the cells before 
analysis. The mock infection variants represent cells incubated with virus dilution buffer 
only. The quadrant borders were set based on the background signals obtained with both 

5 the GFP- and PE-matched negative controls. The percentages of stained cells found in 
each quadrant are indicated. The experiment was performed two times in triplicates, and 
typically obtained results are shown. The SEM was less than 10% of the statistical 
average. (B) Transduction of CD34+/CD117+ cells with AdSGFP and chimeric 
Ad5GFP/F35 virus vectors: CD34+ cells, cultured overnight before staining in media 

10 without SCF, were incubated with PE-labeled anti-CD117 MAb for 30 min on ice. The 
fraction of CD1 17-positive cells was sorted by FACS. More than 97% of sorted cells 
were positive for CD117. IxlO 5 CD117+/CD34+ cells were infected with AdSGFP or 
Ad5GFP/F35 at an MOI of 200 pfu per cell, as for Fig.8. Twenty-four hours post 
infection, the percentage of GFP positive was determined by flow cytometry. For mock 

15 infection, CD117+/CD34+ cells were incubated with virus dilution buffer only. The 
infections were done in triplicates, and the average percentage of GFP-expressing cells is 
indicated on the corresponding histogram. The SEM was less than 10% of the statistical 
average. 

20 Fig. 24 shows the southern analysis of viral genomes in GFP-positive and GFP-negative 
fractions of CD34+ cells infected with the Ad5GFP and chimeric Ad5GFP/F35 vectors. 
CD34+ cells were infected with viruses at an MOI of 100 as described for Fig.21 . Twenty 
four-hours post infection, cells were sorted by FACS for GFP positive and GFP negative 
fractions. 10 5 cells from each fraction were used to isolate genomic DNA together with 

25 viral DNA. Before cell lysis, a rigorous treatment with trypsin and DNase followed by 
washing was performed to exclude that genomic DNA samples were contaminated by 
extracellular viral DNA. A) The upper panel shows the ethidium bromide stained 1% 
agarose gel before blotting demonstrating that similar amounts of genomic DNA were 
loaded. This amount corresponded to DNA isolated from -25,000 GFP+ or GFP- ceils. 

30 The lane labeled Aload@ represents viral DNA purified from AdSGFP or Ad5GFP/F35 
virions mixed with pBluescript plasmid DNA (Stratagene) as a carrier and applied on a 
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gel at the amount that was actually used to infect 25,000 cells. As a concentration 
standard, a serial dilution of AdSGFP genomes was loaded on the gel (left side). For 
Southern analysis (lower panel), an 8kb-long HindlH fragment corresponding to the E2 
region of Ad5 was used as a labeled probe. Hybridized filters were subjected to 
5 Phospholmager analyis and then exposed to Kodak-X-OMAT film for 48 h at B70°C. 
The cellular/viral genomic DNA is indicated by an arrow. (B) To detect AdSGFP 
genomes in transduced cells, PCR amplification followed by Southern blot hybridization 
was performed on the same samples that were used for quantitative Southern blot 
hybridization in (A). DNA purified from -2,500 cells was subjected to PCR (95°CBimin, 

10 53°C-lmin ;i 72°CB Imin, 20 cycles with primers Ad5-Fl and Ad5-Rl). One fifth of the 
PCR reaction was subjected to agarose gel electrophoresis (upper panel). A 0.9 kb-long 
DNA fragment, specific to the E4 region of Ad5 was detected for transduced 
Ad5GFP/F35 genomes. DNA then was blotted onto Nybond-N+ membrane and Southern 
blot hybridization (lower panel) with an Ad5 E4 specific DNA probe was performed. In 

15 addition to the 0.9kb DNA fragment, the PCR primers generated a smaller 0.5 kb-long 
fragment that also hybridized with with the E4 region probe. 

Fig. 25 shows the role of fiber shaft length in Ad infection strategies. CAR binding (Ad5 
and Ad9) variants and Ad35, which interacts with a non-CAR receptor were analyzed on 
20 CAR expressing cells (293, Y79) and K562 cells which do not express significant CAR 
amounts. All vectors contain a GFP expression cassette packaged into an Ad5 capsid 
with modified fibers. 

Fig. 26 shows the tertiary structure of Ad5 knob: localization of CAR binding sites, H-I 
25 loop and G-H loop. 

Fig. 27 shows the substitution of the G-H loop with heterologous peptides. 

Fig. 28 shows the attachment and internalization of metabolically labeled serotypes with 
30 human cell lines. 
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Figs. 29A-29D shows the generation of Rep78 expressing Ad vectors by recombination 
between two vectors. (A) The same strategy outlined in Fig. 15 was employed for 
vectors with rep 78 as a transgene. The AdSVep vector also contained the ApoEhAAT 
promoter shielded by an HS-4 insulator. The region of homology between the two 
5 fragments of the rep78 gene was 658nt in length. The Rep78 ORF was deleted for the p5 
promoter. The internal Rep 40/52 start codon (at position 993) was mutated to abolish 
production of the small Rep proteins. Furthermore, the splice site at nt 1905 was deleted 
eliminating production of Rep68. The individual expression of Rep 78 was 
demonstrated. (B) Formation of AAd.rep78 genomes. The expected 5.8kb AAdrep78 
10 genome was only observed upon coninfection of both Ad5Vep and Ad3'rep into 293 celss 
as demonstrated by Southern. (C) Southern blot analysis for rescue of the recombinant 
AAV genome from plasmid DNA by Rep78 expressed from pCMVrep78 and AAd.rep78. 
The expected rescue product is 3.8kb (R-plasmid). (D) Southern blot analysis for rescue 
of the recombinant AAV genome from Ad.AAV viral vector genomes. 

15 

DETAILED DESCRIPTION OF THE INVENTION 

All scientific and technical terms used in this application have meanings commonly used in 
20 the art unless otherwise specified. As used in in the present invention, the following words 
or phrases have the meanings specified. 

The term vector includes, but is not limited to, plasmids, cosmids, phagemids, and 
artificial chromosomes. The vector sequence may be designated as the viral "base 
25 vector" sequence. The base vector sequence is dependent upon the particular type of 
virus and serotype that the base vector sequence was derived from. The base vector 
sequence may be linked to non-vector or transgene sequences (e.g., heterologous 
sequences). 

30 The transgene sequences may include sequences that confer compatibility with 
prokaryote or eukaryote host cells, where compatibility relates to vector replication 
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within a host cell. Accordingly, the transgene sequence may be a replicon sequence that 
directs replication of the vector within the host cell, resulting in an autonomously replicating 
vector. Alternatively, the transgene sequence may permit vector replication that is 
dependent upon the host cell's replication machinery. 



The base vector sequences may be a operatively linked to a transgene sequence that 
encodes a gene product, such as a polypeptide, rRNA, or tRNA. For example, the 
transgene may encode a polypeptide such as a viral capsid protein, or a viral fiber protein. 
The transgene may be derived from the same or different serotype as the base vector 
10 sequence. 

Another example of a transgene includes a reporter gene that encodes a gene product that 
can be used as a selectable marker, such as drug resistance or a colorimetric marker. The 
reporter gene may encode a gene product which can be readily detected by, for example, 
15 a visual microscopic, immunochemical, or enzymatic assay. The preferred reporter gene 
encodes a gene product that can be detected by a non-destructive method that does not 
destroy the cell that expresses the reporter gene. 

A therapeutic gene is another example of a transgene. A therapeutic gene encodes a gene 
20 product (e.g. ? polypeptide or RNA) which when expressed in a host cell provides a 
therapeutic benefit or desired function to the host cell or the tissue or the organ or the 
organism containing the host cell. The therapeutic benefit may result from modifying a 
fimctin of a gene in the host genome or from the additional function provided by the 
therapeutic protein, polypeptide or RNA. 



The base vector sequence may be linked to a transgene sequence that is an regulatory 
element, such as a promoters, enhancers, transcription termination signals, 
polyadenylation sequences. The regulatory element may direct expression of the 
transgene sequence that encodes a gene product by direct transcription or translation. 
30 The regulatory element may regulate the amount or timing of expression of the transgene 
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sequence. The regulatory element may direct expression of the transgene in certain host 
cells or tissues (e.g., host-specific or tissue-specific expression). 

The base vector sequence may linked to a transgene sequence that permits the vector, to 
5 integrate into another nucleotide sequence. The integration sequence may direct 
integration of the whole vector or portions of the vector. The integration sequence may 
or may not be related to the base vector sequence. For example the integration and base 
vector sequences may be from the same or different viral serotype. The integration 
sequence may be inverted repeat sequences (ITRs) from adenovirus (Ad), adenovirus- 
10 associated virus (AAV), or HIV. 

The base vector sequence may be linked to a transgene sequence that directs homologous 
recombination of the vector into the genome of a host cell. Such transgene sequences 
may or may not be from the same viral serotype as the base vector sequence. 



The vector may be used to transport the heterologous sequence into a host cell or into a 
host cell's genome. 

The vector may comprise multiple endonuclease restriction sites that enable convenient 
20 insertion of exogenous DNA sequences. 

The term "hybrid vector" as used in the invention refers to a vector which comprises a 
nucleic acid sequence combined from two different viruses (e.g. Adenovirus and AAV). 

25 "Chimeric vector" refers to a vector which contains nucleic acid sequences that are 
unnatural to the base vector (i.e. sequences not occurring naturally or sequences not in 
their natural background including heterologous sequences). A chimeric vector as used 
in the invention may also be a hybrid vector. An example of a chimeric vector is 
Ad.AAV expressing a modified fiber protein an its capsid. 
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The term "transduction" or "infection" refers to a method of introducing viral DNA 
within a virus particle into a host cell. The viral DNA herein is in the form of 
recombinant virus, which is generated by linking a segment of DNA of interest into the 
viral genome in such a way that the gene can be expressed as a functional protein. 

5 

The term "transfection" refers to a method of introducing a DNA fragment into a host 
cell. 

The term "heterologous" as used herein means that a nucleic acid or peptide sequence is 
!0 placed in a context that is not endogenous to the base adenovirus vector or to a 
transduced cell. For example, a peptide sequence can be transferred from a protein to 
another protein, the resulting protein is referred to herein as heterologous protein. A 
chimeric fiber protein, (e.g.. a serotype 5 tail domain and a serotype 35 shaft and knob 
domain) is considered a "heterologous" to the Ad 5 vector. The term also includes 
15 nucleic acids (e.g. coding sequences) from one strain or serotype of adenovirus 
introduced into a different strain or serotype of adenovirus. 

The term "regulatory elements" is intended to include promoters, enhancers, transcription 
termination signals, polyadenylation sequences, and other expression control sequences. 
20 Regulatory elements referred to in the invention include but are not limited to. those 
which direct expression of nucleic acid sequence only in certain host cells (e.g. tissue 
specific regulatory sequences). 

The term "operably linked" indicates that a polynucleotide sequence (e.g. a coding 
25 sequence or gene) is linked to a regulatory element in such a way that the regulatory 
element sequence controls and regulates the transcription or translation or both of that 
polynucleotide sequence. The orientation of the regulatory element may vary (eg, be in 
reverse orientation with respect to the right ITR). The term also includes having an 
appropriate start signal (e.g., ATG) in front of the polynucleotide sequence to be 
30 expressed and maintaining the correct reading frame to permit expression of the 
polynucleotide sequence under the control of the expression control sequence and 
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production of the desired polypeptide or protein. Regulatory sequences can also include 
3' sequences which ensure correct termination (eg. poiyadenylation stop signal). 

The term "gene therapy" used herein, refers to a method which introduces a segment of 
5 exogenous nucleic acid into a cell in such a way that it results in functional modification 
to the recipient cell by expression of the exogenous nucleic acid. The exogenous nucleic 
acid is typically therapeutic in that the expression of the encoded protein, polypeptide or 
RNA corrects cellular dysfunction due to a genetic error or more generally counteracts 
any undesirable functions which are associated with a genetic or acquired disease. The 
10 term "exogenous nucleic acid" refers to DNA or RNA sequences not normally expressed 
in the treated transformed cell. The term also refers to DNA and RNA sequences which 
are expressed in a treated transformed cell at a higher, lower or in an otherwise different 
pattern than in the untreated, non transformed cell. This non-natural expression can also 
be termed heterologous expression. 

15 

A "gene therapy vector" refers to a vector used for gene therapy, i.e. to introduce the 
exogenous nucleic acid into a recipient or host cell. The exogenous nucleic acid may be 
transiently expressed or integrated and stably expressed in the recipient or host cell. 

20 The term "plasmid" as used herein refers to any nucleic acid molecule which replicates 
independently of the host, maintains a high copy number, and which can be used as a 
cloning tool. 

The term "parallel strand of DNA" and "anti-parallel strand of DNA" refers to as each of 
25 the strands of DNA of the double stranded adenovirus. The Figures diagram the location 
of certain nucleotides on the parallel strand of DNA. The anti-parallel strand of DNA 
refers to the other of the two strands of DNA which is not depicted in the Figures. The 
fiber protein is encoded on the anti-parallel strand of DNA. To simplify the vector 
diagrams, the fiber sequences are shown on the parallel strand even though the gene is 
30 located on the anti-parallel strand. 
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The term "reporter gene" refers to any nucleic acid sequence which encodes a 
polypeptide or protein which can be readily detected by, for example, a visual, 
microscopic, immunochemical or enzymatic assay. Preferred reporter genes are those 
that can be detected by a non-destructive method that does not destroy the treated, 
transformed cells or tissue. 

The term "selection gene" used herein refers to any nucleic acid fragment which encodes 
a polypeptide or protein whose expression is used to mark a cell as a transformed cell by 
a given vector. 

The term "therapeutic gene" refers herein to a DNA fragment encoding a functional 
polypeptide, protein or RNA, which when expressed in a host cell provides a therapeutic 
benefit or desired function to the host cell or to the organ or organism containing the host 
cell. The therapeutic benefit may result from modification of a function of a native gene 
in a host or from the additional function provided by the therapeutic protein, polypeptide 
or RNA. 

The term "host tissue" or "host cell" as used herein, refers to a tissue or cell in which a 
therapeutic gene is to be expressed to modify its function. 

It is well-known in the biological arts that certain amino acid substitutions may be made 
in protein sequences without affecting the function of the protein. Generally, 
conservative amino acid substitutions or substitutions of similar amino acids are tolerated 
without affecting protein function. Similar amino acids can be those that are similar in 
size and/or charge properties, for example, aspartate and glutamate, and isoleucine and 
valine, are both pairs of similar amino acids. Similarity between amino acid pairs has 
been assessed in the art in a number of ways. For example, Dayhoff et al. (1978) in Atlas 
of Protein Sequence and Structure, Volume 5, Supplement 3, Chapter 22, pp. 345-352, 
which is incorporated by reference herein, provides frequency tables for amino acid 
substitutions which can be employed as a measure of amino acid similarity. Dayhoff et 
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al's frequency tables are based on comparisons of amino acid sequences for proteins 
having the same function from a variety of evolutionarily different sources. Therefore, 
any obvious changes in the amino acid sequences (as described above) to the sequences 
of the invention are already contemplated. 

Polypeptides which are "substantially similar" share sequences as noted above except that 
residue positions which are not identical may differ by conservative amino acid changes. 
Conservative amino acid substitutions refer to the interchangeability of residues having 
similar side chains. For example, a group of amino acids having aliphatic side chains of 
amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of 
amino acids having amide-containing side chains is asparagine and glutamine; a group of 
amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a 
group of amino acids having basic side chains is lysine, arginine, and histidine; a group 
of amino acids having sulfur-containing side chains is cysteine and methionine. 
Preferred conservative amino acids substitution groups include but are not limited to: 
valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, 
asparagine-glutamine, and aspartate-glutamate. Therefore, polypeptide substitution for 
"substantially similar" sequences (as described above) to the amino acid sequences 
described invention are already contemplated. 

In order that the invention herein described may be more fully understood, the following 
description is set forth. 

The present invention provides unique gene transfer vehicles which overcome many of 
the limitations of prior art vectors. The invention describes a first generation adenovirus 
vectors comprising left and right Ad ITRs, an Ad packaging sequence, a transgene 
cassette with regulatory elements, and a pair of cassette ITRs flanking the transgene 
cassette that direct predictable viral genomic rearrangements during viral replication as 
well as direct the integration of the transgene cassette into the host cell genome. 
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One predictable rearrangement that occurs during viral replication is the generation of a 
gutless adenovirus vector (also referred to herein as AAd) that comprises right and left Ad 
ITRs, an Ad packaging sequence, a transgene cassette flanked by cassette ITRs and the 
gutless vector is devoid of all other immunogenic viral genes. 

5 

The potential for site-specific integration is an important characteristic of the novel Ad 
vectors of the invention. In an embodiment of the invention, integration of the transgene 
cassette is directed by co-infection with an Ad vector expressing e.g., the rep 78 protein 
to achieve site-specific integration in the e.g., AAVS1 site on human chromosome 19. 

10 

The invention further describes a novel way of targeting these recombinant adenovirus 
vectors to selected cells by modifying the adenovirus fiber protein that is expressed on 
the capsid. Changes to both the fiber shaft and the fiber knob domain proved to 
successfully retarget the Ad vector to a desired cell type. In addition, the G-H loop within 
15 the fiber knob domain is identified as a novel site that affect the binding affinity and 
specificity of the recombinant adenovirus vector. Substitution of peptide sequences into 
the G-H loop retarget the gutless vector to a desired cell type. 

An adenovirus display library has been generated that expresses random peptides within 
20 the G-H loop of the fiber protein. This type of a library is used as ligands to screen for 
adenovirus vectors that bind to desired cell types. One advantage of using an adenovirus 
display library versus a phage display library is that once adenovirus affinity to a desired 
cell is identified the targeted adenovirus vector is ready to accept a transgene cassette and 
can be used to generate a gutless adenovirus vector, for example for use in gene therapy. 

25 

The chimeric Ad vectors described below contain a modified fiber protein in the capsid 
of the adenovirus which renders the vector capable of infecting a desired cell types. 
Therefore, according to the invention, a gutless chimeric AAd-AAV vector can be 
generated to introduce any transgene(s) into any host cell or tissue which is normally 
30 refractory to most commonly used gene therapy viral vectors. In addition, the chimeric 
AAd.AAV vector of the invention, is devoid of adenoviral genes, and contains AAV ITR 
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sequences that flank the transgene cassette, which direct stable transgene integration in 
the host genome allowing long term expression of the transgene. 

The transgene cassette described in the invention may carry a transgene which is either a 
5 reporter gene, a selectable gene for in vitro or in vivo selection of transduced cells, or a 
therapeutic gene. In one embodiment of the invention the reporter trangene can be but is 
not limited to, Pgalactosidase. Many reporter genes are commonly used in the art, of 
which any could be carried as a transgene in the Ad.AAV vector of the invention. Other 
examples of reporter are genes are GFP and alkaline phosphatase. 

10 

The following describes an embodiment of the first generation Ad vectors of the 
invention having a wild-type capsid and a transgene cassette flanked by cassette ITR 
sequences; (b) fiber protein that is modified to retarget Ad vectors; and (c) the 
combination of both technologies that enables the production of chimeric AAd vector 
15 including a modified fiber protein expressed on the capsid which retargets the base vector 
to a desired cell type for infection and transgene integration. 

A. Integrating Ad hybrid vectors of the invention: 

It has been shown that inverted repeats (IRs) inserted into the El region of AdEl- vectors 
20 can mediate predictable genomic rearrangements resulting in a gutless vector genome 
devoid of all viral genes. A specific embodiment of such IR-mediated rearrangements is 
the Adeno-AAV, first generation hybrid adenovirus vector containing AAV inverted 
terminal repeats (ITR) flanking a transgene cassette. The AAV ITRs mediate the 
formation of a genome similar to that of the AAd.IR genome (Steinwaerder et al., 2000 
25 Journal of Virology). AAd vectors devoid of all viral genes stably integrate and transduce 
cultured cells with efficiencies comparable to e.g. rAAV vectors. The Examples 
demonstrate by Southern blot analysis that the AAd vectors integrate randomly into the 
host genome. 

30 The Ad vectors of the invention comprise a left Ad ITR, an adenovirus packaging 
sequence located 3' to the Ad ITR; a transgene cassette located 3' to the packaging 
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sequence comprising a polyadenylation signal, a transgene, and a heterologous promoter, 
and flanked by a pair of cassette ITRs. Adenoviral genes used for replication such as El, 
E2, E3, E4 are located 3' to the right cassette ITR and a right Ad ITR is located 3' to the 
replication genes. The vectors of the invention are particularly suited to treat: genetic 
5 disorders, cancers, and infectious diseases (such as HIV, emboli, or malaria). Treatable 
genetic diseases such as hemophilia A and B; cystic fibrosis; muscular dystrophy, and 
(Xi antitrypsin disorder are ideal candidates for genetic disease that can be treated by 
vectors of the invention. A specific example of a therapeutic gene to combat a genetic 
disorder is gamma-globin to ameliorate sickle cell anemia. 



To aid in the selection of transduced cellls and characterize the intergration site of the 
transgene cassette, an embodiment of the invention includes the addition of a sequence 
comprising a bacterial for the origin of replication, plus a selectable gene. An 
embodiment of this is an SNori sequence added to the transgene cassette. This allows the 
15 AAd to be expressed in human and bacterial cells, therefore allowing selection of the 
transduced cells and characterization of the integration site in the genome of transduced 
mammalian cells. 

The potential for site-specific integration is an important characteristic of the novel Ad 
20 vectors of the invention. In an embodiment of the invention, integration of the A Ad. AAV 
is directed by co-infection with Ad AAV expressing the rep 78 protein in 293 cells to 
achieve site-specific integration in the AAVS1 site on human chromosome 19. For this 
type of site-specific integration to occur in cells other than 293 cells, E4 ORF6 
expression is required. The co-infection of AAd.AAV, AAd. rep 78, and AAd. E4-orf6 
25 allows for site specific integration of the AAd.AAV transgene cassette. The AAd. rep78 
and the AAd. E4-orf6 genomes are degraded soon after transduction, thus avoiding 
potential side effects. Site-specific integration is preferred over random integration, 
which is seen with rAAV and AAd.AAV, in order to reduce the risk of insertional 
mutagenesis. 
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Integration of the transgene cassette contained in the adenoviral vectors into 
chromosomes may be associated with silencing (or blocking) of transgene expression. 
The silencing of transgenes can be overcome by adding insulator elements to the 
transgene cassette. For example, HS-4 insulator elements derived from the chicken- 
5 globin LCR can function in Ad vectors to shield heterologous promoters from adenoviral 
enhancers. HS-4 insulators or the Drosophila Gypsy gene can also be used to prevent 
silencing transgenes. 

Another embodiment of the invention is to split the transgene cassette into two portions 
10 of the transgene each carried in a different recombinant adenoviral vector of the 
invention. Each portion of the same trangene has an overlapping region of homology. 
After infection with both vectors, each carrying the different but overlapping portion of 
the same transgene, homologous recombination event occurs resulting in the 
reconstitution of the complete transgene which is then expressed. This technique is used 
15 to produce hybrid adenoviral vectors that accommodate large inserts including, but not 
limited to a 13kb genomic hAAT gene or a 12kb yglobin LCRygJobin expression 
cassettes for ameliorating sickle cell anemia (or correcting y-globin mutations). The 
formation of the hybrid AAd vector genomes, after recombination between two vectors, is 
more efficient if the overlapping region of homology within the transgene is longer. 



An advantage of the present invention is a method to rapidly isolate pure gutless hybrid 
adenoviral vectors such as AAd. AAV or AAd.AAV fx vectors. To minimize the 
contamination of AAd with first generation vectors (Ad vectors) a strategy is described in 
Example I H. It is anticipated that these approaches will yield the same titer of AAd 
25 vectors, however the contamination with full-length genome vectors will be less. This 
improved isolation of the vectors is extremely important to avoid toxic side effects after 
in vivo application. 

B. Tropism modified adenovirus vectors: 
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The Ad vectors of the invention can be modified so that they target a host cell of interest. 
There are more than 50 human Ad serotypes (Appendix I), including variants with 
different tissue selectivity or tropism. It is accepted in the art that different Ad serotypes 
bind to different cellular receptors and use different entry mechanisms. Most recombinant 
5 adenovirus vectors use adenovirus serotype 5 as the base vector serotype 5 (Ad5) (Hitt, 
M.M., et all, 1997, Adv. in Pharmacology 40, 137-205). Ad5 infection is primarily 
mediated by its fiber protein binding to CAR and secondarily by its penton base protein 
binding to integrin. Due to the lack of CAR and/or integrin expression on many cell and 
tissue types, Ad5 mediated gene transfer is inefficient in a number of tissues which are 
10 important targets for gene therapy such as endothelia, smooth muscle, skin epithelia, 
differentiated airway epithelia, brain tissue, peripheral blood cells, or bone marrow. The 
following describes Ad5 vectors of the invention having a change in infectivity and 
tropism as a result of altering the fiber protein sequence. 

15 The infectivity of different Ad serotypes is limited to a number of human cell lines. 
Infectivity studies revealed that Ad5 and Ad3 are particularly suitable for infecting and 
targeting endothelia] or lymphoid cells, whereas Ad9, Adl 1 and Ad35 efficiently infected 
human bone marrow cells. Therefore, the knob domain of the fiber protein of Ad9, Adl 1 
and Ad35 are excellent candidates for retargeting the Ad5 vector to human bone marrow 

20 cells. Other possible serotypes include Ad7. 

In the modified fiber protein of the invention the fiber knob domain of the Ad5 fiber has 
been replaced with another Ad serotype fiber knob domain. An embodiment of the 
invention is the modified Ad5/35 fiber protein (a recombinant Ad5 vector expressing a 

25 modified fiber protein comprising of a fiber tail domain of Ad5 and the fiber shaft and 
knob domains of Ad35). The Ad5/35 chimeric fiber protein shows a broader spectrum of 
infection to a subset of CD34+ cells, including those with stem cell activity. The Ad5/1 1 
chimeric fiber protein (a recombinant Ad5 vector expressing a modified fiber protein 
comprising the fiber tail domain of Ad5 and the fiber shaft and knob domains of Adl 1) 

30 showed similar tropism. 
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In addition to the knob domain modifications;, the invention describes the added 
advantage of modifying both the fiber shaft domain and the fiber knob domain to produce 
a shortened fiber protein. The length of the fiber shaft domain plays a key role in the host 
receptors used for viral vector entry into the host cell. To show this Ad5, Ad5/9, and 
5 Ad5/35 variants were constructed with long (22P-sheets) and short-shafted (7p-sheets)- 
shafted fibers. These analyses demonstrated that efficient viral infection involving CAR 
as the primary receptor for Ad5, Ad5/9 requires a long-shafted fiber protein, whereas the 
cell entry strategy of Ad5/35 (which binds to an still uncharacterized non-CAR receptor) 
does not depend on the shaft length (Fig. 3). The modification in both the fiber shaft 
10 domain length (between 5 > 10 p-sheets) and the fiber knob domain (from a different 
Adserotype than the base vector is a novel mode of altering Ad vector tropism. 

To broaden the repertoire of cell types that Ad vectors can infect, a specific binding 
region, the G-H loop, within the knob domain has been newly identified herein to 

15 improve binding affinity and specificity. Alteration within this region will redirect the Ad 
vector to a desired cell type. For example, the invention describes the G-H loop sequence 
within the fiber protein knob domain, which can be replaced with heterologous peptide 
ligand sequences without affecting the functionally important tertiary structure of the Ad 
fiber knob domain, while changing the binding affinity and specificty of the vector (Figs. 

20 6,7). This G-H loop region is exposed on the central part of the knob surface and may be 
strategically a better site for incorporation of heterologous ligands than the peripheral H-I 
loop (Krasnykh, V. et al., 1998 . J. Virol, 72:1844-52.) of the knob C-terminus (Michael, 
S. I., et al., 1995 . Gene Then, 2:660-8., Wickham, T. J. et al., 1996, Nat. Biotechnol, 
14:1570-3.), which are the substitution sites used by others. Therefore, these G-H loop 

25 modifications within the fiber knob domain will allow the Ad vector to be redirected to 
infect a desired cell type, as long as the G-H loop ligand sequence binds to at least one 
surface protein on the desired cell type. Fig. 7 shows some possible substitutions. 
Example II J demonstrates that the virion tolerates the insertion of a cycling peptide (12 
amino acids) with a constrained secondary structure that allows the exposure on the knob 

30 surface. A defined ligand (RGD) can be inserted into the G-H and the H-I loop of an 
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Ad5 capsid that is ablated for CAR, and integrin tropism. Infectivity studies show the 
potential advantage of this new insertion site. 

Use of the Vectors of the Invention for "Gene Therapy" 

5 

The liver is the major organ for protein synthesis. Thereforean important goal of gene 
therapy is to target gene therapy vectors to the liver. To genetically correct many types of 
mutant proteins hepatocytes need to be infected with gene therapy vectors carrying a 
corrected transgene. Example II J describes a G-H loop substitution in the knob domain 
10 of the fiber protein with both Rl and RII+ (of the malaria circumsporozite surface 
protein) in a short shafted fiber protein which directs the vector to have affinity and 
specificity to hepatocytes. 

Example II K applies a similar protocol to alter the fiber knob domain in the G-H loop 
15 region with peptides that target the vector breast cancer cell lines (MDA-MB-435). These 
novel approaches to redirect vectors described in the invention allow lower doses of the 
gene therapy vectors to be administered with a higher safety profile. 

In example II L a protocol for preparing an adenovirus display library is described that 
20 uses the fiber knob protein to display a library of random peptide sequences within the G- 
H loop. This library of adenoviruses with modified fiber proteins is screened for affinity 
and specificity for a desired cell type. There are two main advantages of using this 
adenovirus display library to screen for target peptides that allow binding to a desired cell 
type over a phage display library system. First, once a ligand peptide is identified that 
25 binds to the desired cell type it is already in the vector of choice for gene therapy 
delivery. The peptide does not need to be engineered into another vector, as is the case 
for the phage display library vectors. This reduces the steps required to identify a targeted 
fiber protein for a desired cell type. The second advantage of this method, is that the 
adenoviruses are able to display multiple copies of the modified fiber protein on their 
30 capsid. This allows for dimerization and trimerization of the fiber protein with the host 
cell receptor. The multimerization of fibers proteins is a realistic, in vivo interaction of 
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the trimeric fiber protein with the host cell receptor. In contrast, phage vectors can only 
display one fiber peptide sequence on their surface, which significantly limits the ability 
of interaction with host cell surface receptors. 

5 C. A chimeric adenovirus vector with selective tropism: 

The chimeric vectors of the invention combine two vectors: an Ad.ITR and a Ad. fx 
where fx describes a modified fiber protein. A first generation adenovirus vector of 
serotype 5 is the base vector that carries a transgene cassette flanked by heterologous 

10 ITRs. These specific inverted terminal repeat sequences, such as AAV ITRs direct stable 
integration of the transgene cassette into the host genome as well as control predictable 
genomic rearrangements that occur during viral replication. This vector can also carry a 
modified fiber gene (described in Examples II). During replication predictable genomic 
rearrangements occur which result in the generation of a gutless adenovirus vector (e.g. 

15 AAd.AAV fx ) which expresses the modified fiber protein on its capsid. The modified fiber 
protein allows the gutless vector to be targeted to a selected cell type. The targeted 
vector is a gutless adenovirus vector devoid of adenoviral genes which can integrate its 
transgene into the host genome. The transgene cassette can carry reporter, selectable, or 
therapeutic genes. 



In one embodiment of the invention, the gutless targeted AAd.AAV carries the reporter 
gene of Pgalactosidase. AAd.AAV f * "BG). For easy in vitro selection of human and 
bacterial cells that are transduced with the hybrid Ad vector, a bacterial sequence for the 
origin of replication can be added to the hybrid Ad vectors. An example of this is 
25 AAd.AAV** -Snori, in which a SNori sequence is added into the transgene cassette. This 
site allows for G418 selection on cells infected with AAd.AAV fx -SNori. This in vitro 
selection provides a tool to analyze the site of transgene integration and the flanking 
chromosomal regions. Fluorescent in situ hybridization (FISH) is an alternative method 
to confirm vector integration. 



20 
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An advantage of the AAd chimeric vector for gene transfer is the efficient and stable 
integration of a large transgene cassette up to e.g. 22kb which is significantly larger than 
the capacity of retroviral vectors . This is of particular interest for gene therapy. For 
example, to ameliorate sickle cell anemia AAd.AAV fx yglobin , an expression transgene 
5 cassette with the gamma-globin gene that targets and integrates, can be inserted into bone 
marrow stem cells for long term expression of the gamma-globin gene. 

To achieve site-specific gene integration, rep78 protein is used for transgene integration 
into the AAVS1 site (described in Example 1 D). However, this may silence transgene 
10 expression. To prevent the integrated transgene from being silenced by host genomic 
elements (such as positional effects or downstream enhancers), LCRs or insulator 
elements are incorporated into the transgene cassette. 

The following examples are presented to illustrate the present invention and to assist one of 
15 ordinary skill in making and using the same. The examples are not intended in any way to 
otherwise limit the scope of the invention. 

EXAMPLE I 

20 NOVEL ADENOVIRAL VECTOR Ad.AAV 

A. Integrating AAd.AAV hybrid vectors devoid of all adenoviral genes. 

In vitro and in vivo studies with rAAV indicate that the only requirement for rAAV 
25 integration are the AAV ITRs and as yet unknown host cellular factors. It is thought that 
specific sequences or secondary structures present in AAV ITRs are prone to integration 
into host chromosomal DNA. In order to combine advantages of adenoviral vectors (high 
titer, high infectivity, large capacity) and the integration capability of AAV ITRs, AAV 
vector DNA with AAV ITRs flanking cassettes a secreted human placental alkaline 
30 phosphatase (SEAP) - neomycin phosphotransferase (neo) reporter gene cassette 
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(Alexander, I.E.. et al. 1996, Gene Therapy, 7, 841-850) is incorporated into the El - 
region of E1/E3 deleted adenoviral vectors (Ad.AAVl) (Figure 1, top). 

METHODS 

5 

Production/Characterization of Viral Vectors 
Plasmids : 

10 The AAV1 vector cassette containing AAV ITRs and SEAP/neo expression units is 
obtained by Asei/Scal digestion of the plasmid pALSAPSN (Alexander, I. E. et ah 1996, 
Human Gene Therapy. 7:841-50). The 4.4kb AAV vector fragment was cloned via NotI 
adapter linkers into pXJCLl (Mirobix, Toronto, Canada) (pAd.AAVl). Another shuttle 
vector (pAd.AAVl -A2ITRs) lacking the AAV ITRs is generated by inserting the 3.7kb 

15 Aflll/BsmI fragment of pALAPSN into pXJCLl. For pAd.AAVl A1ITR, a construct is 
used where a spontaneous deletion in the left AAV ITRs between the A and A' regions 
has occurred. To create a second hybrid vector (Ad.AAV2), the AAVSNori cassette 
developed by E. Rutledge is used. AAV vector DNA obtained is from pASNori 
(Rutledge, E. A., Russell, D.W. 1997. Journal of Virology 71:8429-8436) as a 3.4kb 

20 Bsal/Scal fragment and inserted into the EcoRV site of pXCJLl. As it is generally 
known for AAV vector plasmids, the AAV ITRs are prone to rearrangements. To 
minimize deletions in these functional critical regions, all constructs for generation of 
hybrid vectors are assembled in low copy-number plasmids which are grown in E. coli 
ToplO, JC811, or XL1 Bluecells (Stragene, La Jolla, Calif). Furthermore, after each 

25 cloning step or large-scale plasmid amplification, both AAV ITRs are carefully mapped 
by restriction analysis with enzymes that cut inside or adjacent to the ITRs (BssHII, 
Ahdl, Smal, Bgll, BsmI, Aflll, and Seal). 
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Adenoviruses ; 

First-generation viruses with the different transgene cassettes incorporated into the El 
region are generated by recombination of the pAElaSpIa- or pXCJLl -derived shuttle 
5 plasmids and pJM17 (Microbix) in 293 cells as described earlier (Lieber, A., et al., 1996, 
J. of Virology^ 70, 8782-8791). For each virus, at least 20 plaques are picked, amplified, 
and analyzed by restriction digest. Viruses containing two AAV ITDRs tend to rearrange 
within the ITRs, with other adenoviral sequences, or with adenoviral sequences present in 
the 293 cell genome. Only plaques from viruses with intact ITRs are amplified, CsCl 
10 banded, and titered as described earlier (Kay, M. A., et al. 1995. Hepatology 21 :81 5-819; 
Lieber, A., et al. 1996. Journal of Virology 70:8944-8960). All virus preparations tested 
are negative for RCA and bacterial endotoxin (Lieber, A., et al.1997. Journal of Virology 
71:8798-8807). Virus is stored at -80°C in lOmM Tris-Cl, pH 7.5-1 mM MgCl 2 -10% 
glycerol. 

15 

To generate AAd.AAV, 293 cells are infected with Ad. AAV 1 at an multiplicity of 
infection (MOI) of 25 and harvested 40h after infection. Cells are lysed in PBS by 4 
cycles of freeze/thawing. Lysates are centrifuged to remove cell debris and digested for 
30 min at 37*C with 500 units/ml DNasel and 200 |ig/ml RNaseA in the presence of 

20 lOmM MgCh- 5ml of lysate is layered on a CsCl step gradient (0.5ml - 1.5g/cm 3 , 2.5ml - 
1.35g/cm 3 , 4ml - 1.25g/cm 3 ) and ultracentrifuged for 2h at 35,000 rpm (rotor SW41). 
CsCl fractions are collected by puncturing the tube and are analyzed for viral DNA 
(Lieber, A., et al. 1996. Journal of Virology 70:8944-8960; Steinwaerder, D. S., et al. 
1999. J Virol 73:9303-13) or subjected to ultracentrifugation at 35,000 rpm for 18 hours 

25 in an equilibrium gradient with 1.32 g/cm 3 CsCl. The band containing the deleted viruses 
AAd.AAV is clearly separated (0.5cm distance) from other banded viral particles 
containing full-length ad. AAV genomes. AAd.AAV 1 fractions are dialyzed against 
lOmM Tris-Cl, pH 7.5- ImM MgCl 2 -10% glycerol and stored at -80'C. The genome titer 
of AAd.AAV 1 preparations is determined based on quantitative Southern analysis of viral 

30 DNA purified from viral particles in comparison to different concentrations of a 4.4kb 
Asel/Scal fragment of pALSAPSN according to a protocol described earlier (Lieber, A., 
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et al., 1996, J. of Virology, 70 ? 8782-8791). In total, the production of lxlO 13 genome- 
particles of AAd.AAVl requires less than 3 hours of actual work. 

Titers routinely obtained are in the range of 3-8x1 0 12 genomes per ml. Assuming one 
5 genome is packaged per capsid, the genome titer equals the particle titer. The level of 
contaminating Ad. AAV 1 is less than 0.1% as determined by Southern analysis, which is 
consistent with results obtained by plaque assay on 293 cells (fewer than 5 plaques per 
10 6 total genomes). The primers used for sequencing the left and right ITR-vector- 
junction are 



5'GGCGTTACTTAAGCTAGAGCTTATCTG, and 
5XTCTCTAGTTCTAGCCTCGATCTCAC. 

The recombinant AAV virus stock containing the SEAP/neo cassette (AV2/ALSAPSN, 
15 [Alexander, I. E. et al, 1996, Human Gene Therapy, 7:841-50] used in these studies were 
obtained from Dusty Miller (FHCRC, Seattle). The stock was free of contaminating 
replication competent AAV (<50 particles/ml) and wildtype adenovirus (<100 
particles/ml). The genome titer of the virus stock was obtained by quantitative Southern 
Blots as described by Russell et al. (Russell, D. et al. 1994 Proc. NatLAcad,Sci. USA 
20 91:8915-8919). 

Electron Microscopy: 

For examination of viral particles in the transmission electron microscopy studies, CsCl- 
25 purified virions are fixed with glutaraldehyde and stained with uranyl acetate as described 
previously (Lieber, A., et al. 1996. Journal of Virology 70:8944-8960). 



30 During replication of these hybrid vectors in 293 cells, a 5.5kb genome (AAd.AAVl) is 
efficiently generated and packaged into adenovirus (Ad5) capsids. The AAd.AAVl 
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genome contains the left adenovirus ITR and the packaging signal followed by the AAV- 
vector cassette and a duplicate of the adenoviral packaging signal and ITR in reverse 
orientation (Figure 1, bottom). The hybrid vector is devoid of all viral genes, thus 
eliminating toxic effects and the elicitation of cellular immune responses. The 
5 spontaneous formation of the small hybrid vector genome AAd.AAVl requires the 
presence of two intact AAV ITRs and does not occur with partly deleted ITRs or oligo- 
dC and oligo-dG stretches flanking the expression cassette. 

Hybrid vectors containing different transgenes: 

10 

To construct a hybrid vector with a transgene that can be detected in situ the SEAP/neo 
expression unit in Ad. AAV 1 is replaced by the f, coli p-galactosidase gene. This hybrid 
vector is named AAd.AAVl. During generation of the corresponding plasmid constructs 
the AAV ITR sequences tend to rearrange and abolish their functional properties. This 

15 problem can be circumvented by using low copy number plasmids as cloning vectors 
grown in bacteria strains depleted for all recombination proteins (e.g.JC811). 
Furthermore, the intactness of both AAV ITRs after each cloning step can be examined 
for characteristic endonuclease digestion. Recently, another hybrid vector 
AAd.AAVl Nori has been generated which contains the neo gene under the control of 

20 both the simian virus 40 (SV40) early promoter and the transposon 5 (Tn5) promoter for 
expression in human and bacterial cells, as well as the pl5A bacterial replication origin 
with the direction of the leading strand DNA synthesis opposite that of neo gene 
transcription. Thus, SNori can be used for G418 selection of integrated vector in 
eukaryotic cells as well as for rescue of vector together with flanking host DNA after 

25 integration. The recovered plasmids can be propagated in E. coli under selection with 
kanamycin due to the bacterial origin and the neo gene. SNori containing vectors allow a 
rapid estimation of total integration events based on the number of G418 resistant 
colonies. Moreover, vector DNA together with flanking chromosomal DNA can be 
rescued as plasmids from single G418 resistant clones and can be used for sequencing to 

30 determine integration junctions. Both hybrid vectors are produced at a titer of about 
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3xl0 12 genomes per ml. The ratio of genome titer to transducing particles for 
AAd.AAVBG is -200:1 based on P-Gal expression. 

DISCUSSION 



AAd.AAVl could spontaneously form during adenovirus replication. Another possible 
mechanism of AAd.AAVl formation is based on the unique mechanism by which 
adenovirus replicates its genome (van der Vliet, B., 1995, In w. Doerfler, et al. (eds.) vol. 
2 p. 1-31, Springer- Verlag, Berlin) (see Figure 1). Ad DNA replication is initiated by the 

10 TP/pTP (terminal protein) that binds to specific sites within the ITRs on both ends of the 
linear genome and catalyzes, in complex with Ad pol, the binding of the 5' CTP, the first 
nucleotide of the daughter strand. DNA synthesis proceeds in a continuous fashion to the 
other end of the genome (Figure 1 A). Only one of the DNA strands serves as template. 
One of the replication products is a single-stranded DNA that circularizes through 

15 annealing of its self-complementary ITRs. The resulting duplex "panhandle" has the 
same structure as the termini of the duplex viral genome that allows the binding of pTP 
and the initiation for synthesis of a complementary strand using the single-stranded 
"pandhandle" molecule as template (Figure 1C). In the case of Ad.AAVl, the Ad pol 
synthesizes the single strand of the adenoviral genome starting from the left Ad ITR until 

20 it reaches the second AAV ITR. During synthesis of the second AAV ITR a certain 
percentage of the single-stranded molecules form a loop hybridizing to the 
complementary region within the first AAV ITR that was replicated earlier, allowing Ad 
pol to use the same viral DNA strand to read back towards the left ITR (Figure IB). The 
resulting "panhandle" structure can be resolved in a similar way as a full-length 

25 intermediate shown in Figure 1C, generating a double stranded, linear molecule with the 
above described structure that can be packaged into Ad virions. The ratio of viral DNA 
to protein concentration in purified AAd.AAVl particles is comparable to that obtained 
from Ad.AAVl particles. This indicates that despite the smaller size, only one 
AAd.AAVl genome is packaged, resulting in particles with a lighter buoyant density 

30 (-1.32 g/cm 3 ). Electron microscopy demonstrates the icosahedral shape of AAd.AAVl 
particles (Figure 2). Staining with uranyl acetate causes the central viral cores to appear 
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electron dense. AAd.AAVl virions have only a spotted luminal dark staining as expected 
with only one 5.5kb genome being packaged per capsid. 

B. In vitro AAd.AAVl production: 



Characteristics of deleted adeno-AA V vectors (AAd.AA V); 

A number of experiments to clarify the mechanisms of AAd.AAV genome formation are 
carried out. Specifically, the presence of two intact AAV ITRs flanking a reporter gene 

10 cassette is required for the effective formation of AAd.AAV genomes. This process does 
not occur with partially deleted ITRs or oligo-dC and oligo-dG stretches flanking the 
expression cassette. Furthermore, in vitro transduction studies are performed with 
different genome titers of AAd.AAVl, Ad.AAVl, and Ad.AAVl-A2ITRs, (lacking the 
two AAV ITRs) which determine the number of G418 resistant colonies that formed after 

1 5 4 weeks of selection (Table I). 

AAd.AAVl is routinely produced at a high titer (5 x 10 12 genomes per ml with >10 4 
produced genomes per 293 cell) and at a high purity with less than 0. 1 % contaminating 
full length Ad.AAVl genomes by a technique normally used for amplification and 
20 purification of recombinant adenovirus. 

In vitro transduction studies with hybrid vectors on CD34+ cells and ervthroleukemia 
cells: 

25 In order to test whether the hybrid vectors allow for gene transfer into cell types, that 
have to be targeted for sickle cell therapy, infection/transduction studies are performed 
using CD34+ enriched human bone marrow cells, derived from mobilized peripheral 
blood and the human erythroleukemia cell line K562 which express e and y globin genes. 
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Cell Culture: 

SKHepl cells (HTB-52, American Type Culture Collection, Rockville, MD), an 
endothelial cell line derived from human liver [Heffelfinger, S.C., et al., 1992, In vitro 
5 Cell Dev. Biol 28A, 136-4-142], are grown in high-glucose Dulbecco's modified Eagle 
medium with 10% fetal calf serum. SKHepl cells are analyzed for integrated AAV 
provirus by Southern analysis of genomic DNA using the AAV1 wild type genome 
obtained from pAAV/Ad (Samulski, R. J., et al. 1989. Journal of Virology 63:3822-3928) 
(gift from David Russell, University of Washington) as a probe. No specific bands are 

10 detected in undigested genomic SKHepl DNA or after digestion with Hindlll. For viral 
infection, confluent cells are incubated with different viral doses for 2 hours, followed by 
intensive washing. For G418 selection, 24h after infection with AAd.AAVl, SKHepl 
cells are trypsinized and plated at different dilutions under G418 selection (900|ig/ml 
active compound, Boehringer-Mannheim, Germany). G418 containing culture medium is 

15 changed every 3 days. The number of colonies with >10 4 cells is counted after 4 weeks 
of selection and divided by the number of initially seeded cells. This ratio is used to 
express the integration frequency of AAd.AAVl . Single colones are obtained by limiting 
dilutions of infected cells in 96 well plates. Colonies are expanded to lxlO 6 cells in the 
presence of G418. Immunofluorescence analysis for adenoviral proteins expressed in 

20 SKHepl cells 3 days post-infection is performed as described earlier [Lieber, A., et al., 
1996, 7. of Virology, 70, 8782-8791]. 

RESULTS 

25 293 cells are infected with the first generation vector Ad.AAVl. During replication of 
Ad. AAV 1, the small AAd.AAVl genome forms spontaneously and is packaged into 
adenovirus capsid. At 36 hours after infection cells are harvested and virus is released by 
several cycles of freeze/thawing. The mixture of Ad.AAVl and AAd.AAVl particles in 
the cell lysate is then separated by ultracentrifugation in a CsCl step gradient. Due to its 

30 lighter buoyant density, the band containing the AAd.AAVl particles is clearly separated 
(0.8cm distance) from the band containing full-length virus (Lieber, A., et al. 1999. J 
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Virol 73:9314-24). AAd.AAVl is purified further by an additional CsCl equilibrium 
gradient and is stored in lOmM Tris pH7.5, 10% glycerol, ImM MgCl 2 in 80*C. In total, 
the production of 2xl0 13 (genome) particles of AAd.AAVl requires less than 3 hours of 
work. All functions for AAd.AAVl replication and particle formation are provided from 
5 Ad. AAV 1 genomes amplified in the same cell. The efficiency of vector production 
measured on a genome-per-cell-basis is comparable or higher than labor-intensive, newer 
techniques for rAAV production, which have not yet been proven to be reliable. The 
estimated ratio of transducing/genome titer for AAd.AAVl is 1:200 (based on SEAP 
expression at day 3 post-infection), whereas for the average rAAV preparation, it is in the 

10 range of 1 :10 3 to 1 :10 4 . lxlO 5 confluent SKHepl cells are infected with different MOIs 
of rAAVl (stock: IxlO 10 genomes per ml), AAd.AAVl (stock: 5xlO i2 genomes per ml), 
Ad. ADA VI (stock: lxl 0 13 genomes per ml), and AdAAV1.2ITR (stock: 9x1 0 12 genomes 
per ml), in a volume of 100m! 24 hours after infection, cells are washed, trypsinized, and 
plated at different dilutions. G418 is added 24 hours after plating and selection is 

15 performed for 4 weeks. G418 resistant colonies contain on average >5xl0 4 cells (at least 
16 cell divisions). A significant number of small colonies visible at 2 weeks post- 
infection do not survive continued selection, probably due to episomal vector expression. 
Cells infected with first-generation adenoviruses with MOIs greater than lxl 0 4 develop 
CPE during the first week of selection. The rAAV titer is not high enough to perform 

20 infection studies with MOIs greater than 10 4 . The colony formation is expressed as 
percentage of the number of colonies after selection to the number of cells initially 
seeded for selection (Table I). 

TABLE I 



25 

Formation of G418 resistant colonies after infection with hybrid viruses in 
comparison with rAAV. 



MOI 




Formation of G418 resistant colonies in % (SEM) 


(genomes 


per 


(after 4 weeks of selection) 


cell) 
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rAAVl 


.Ad.AAVl 


Ad.AAVl 


Ad.AAVl 
.2ITRs 


10' 


0 


0 


0 


0 


10 J 


0 


0 


0 


0 


I0 J 


2.7(1.6) 


1.3(1) 


5.4 (3.0) 


0 


10 4 


90.8 (7.0) 


48.0 (8.9) 


12.9 (7.2) 


0 


10- 


N/A 


93.1 (5.4) 


3.8 (2.1) 


0 


10" 


N/A 


100 


0 


0 


10' 


N/A 


100 


0 


0 



N = 3 (SEM is indicated in parentheses.) 



K562 cells are infected with different MOIs of AAdAA VBG (1-I(f genomes per cell): 

5 Three days after infection, the total number of viable cells (based on Trypan blue 
staining) and the percentage of infected cells (based on X-Gal staining) are determined 
for all MOIs. The results are presented in Figure 4A. 

Initial integration studies: 

10 

K562 cells are incubated with AAd.AAVSNori at an MOI of 2x1 0 5 genomes per cell and 
the colonies that formed after 4 weeks of G418 selection are counted in 96 well plates. 
G418 resistant colonies contain on average >5xl0 4 cells which means that the original 
cell underwent at least 16 cell divisions. 

15 

Infection studies with AdAA VBG (I-I(f genomes per cell) on CD34+cells: 

Cell infection on CD34+are as described for K562 cells. CD34+ cells are cultured in 
IMDM supplemented with 20% FCS, kit ligand (stem cell factor-SCF) (lOOng/ml), and 
20 IL-3 (lOOng/ml). Since a number of reports suggest that specific cytokines like GM-CSF 
or M-CSF which induce stem cell differentiation can stimulate integrin expression and 
may therefore affect internalization of Ad5 vectors, infection rates are compared with 
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Ad5 based hybrid vectors on CD34+ cells cultured with and without pre-stimulation with 
GM-CSF (50ng/ml) or M-CSF (50U/ml). The number of infected cells is counted based 
on X-Gal staining at day 3 after infection. To test for dose-dependent toxicity, viable 
cells are counted (based on trypan blue exclusion) at day 3 post-infection. Furthermore, 
5 whether high viral doses affect the ability of CD34+ cells to differentiate in methyl 
cellulose colony assays in presence of IL-3 and SCF is analyzed. The results are 
expressed as viable cells/X-Gal positive cells vs MOI (see Figure 4B). 



The above data demonstrate that AAd.AAV transduces stably an immortalized human 
cell line with a low frequency comparable to rAAV, however, transduction rates could be 
scaled up to 100% by using greater MOls of AAd.AAV 1, which is produced at higher 
titers than rAAV 1 . In contrast to infection with the first-generation vector, Ad.AAVl, 

15 infection with AAd.AAV 1 is not associated with dose-dependent cytotoxicity because no 
viral proteins are expressed from these vectors in transduced cells. Furthermore, viral 
proteins present in the incoming AAd.AAV 1 particles are not problematic in the dose 
range use. The comparison of transduction rates of AAd. AAV/Ad. AAV 1 with the vector 
lacking AAV ITRs, Ad.AAV-A2ITRs, supports the hypothesis that the presence of two 

20 intact AAV ITRs is crucial for hybrid vector integration. 

The data demonstrate that the leukemia cell line can be infected at —90% efficiency with 
Ad5 based hybrid vectors at MOIs 2xl0 5 genomes per cell without significant toxic side 
effects. However, this dose is still -100 times greater than the dose necessary to infect 
25 100% of HeLa cell, hepastoma cells, primary hepatocytes and other cell lines generally 
considered as permissive for Ad5 vector infection. 

Since viral DNA in cells infected with 2xl0 4 genomes (or 100 transducing particles per 
cell) should be lost after 7 cell divisions, the presence of G418 resistant cells in the 
30 observed colonies suggests that AAd.AAVSNori genomes are integrated into or stably 
associated with the host genome. Based on the number of G418 resistant colonies one 
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out of 25.000 AAd.AAVSNori genomes integrates stably into K562 cells. This is in 
agreement with the results obtained earlier with AAd.AAVl in SK Hepl cells. 

The maximal dose used for infection of CD34+ cells (1x10 s ) results in X-Gal staining of 
5 only -10% of cells independently of GM-CSF/M-CSF. This demonstrates the obvious 
inability of Ad5 to infect CD34+ cells and is probably caused by the absence of specific 
receptors and/or integrins on the cell surface. CD34+ cells tolerate a large range of viral 
doses ( 1-1 0 7 ) without obvious effects on cell viability and total cell number. This is not 
surprising because in order to develop toxic side effects adenovirus has to enter the cell 
10 and express viral genes. Hybrid vectors can be produced at titers of 5x1 0 12 genomes per 
ml. Thus, the maximal MOI that can be used for infection (of 10 4 cells) is ~5xl 0 8 (in 
100pl storage buffer). Based on the infection studies with AAd.AAVBG this dose may 
not be sufficient to efficiently transduce CD34+ cells and to obtain an appreciable 
number of G4 1 8 resistant colonies. 

15 

C. In Vivo Properties Of AAd.AAVl: 

Viral DNA is labeled with BrdU during virus amplification to investigate cellular/nuclear 
vector uptake in situ. For transduction studies, confluent SKHepl cells (a human 
20 endothelial cell line) are infected with 2000 genomes AAd.AAVl or Ad. AAV 1 per cell. 
BrdU tagged viral DNA is detected in 1 00% of nuclei at 3 hours post-infection for both 
viruses indicating efficient cellular and nuclear uptake of hybrid virus DNA. 

RESULTS 

25 

The AAd.AAVl vector transduces a cell in vitro forming G418 resistant colonies with an 
efficiency of 17 or 58%, after infection with an MOI of lxl 0 3 or lxl 0 4 genomes per cell, 
respectively. Approximately 2xl0 4 AAd.AAVl genomes are required to yield one stable 
transfectant. Since all stable colonies contain integrated AAd.AAVl vector DNA, this 
30 number reflects the minimal integration frequency of AAd.AAVl in SKHepl cells which 
is comparable with that from rAAV (Rutledge, E. A. et al., 1997, Journal of Virology, 



41 




WO 00/73478 PCT/US00/1 5442 

71:8429-36). The number of G418 resistant colonies does not necessarily represent the 
total frequency of integration events because not all integrated copies express neomycin 
phosphotransferase, due to chromosomal position effects or incomplete integration. 

5 The absence of adenoviral gene products in AAd.AAVl transduced cells at day 3 post- 
infection is demonstrated by immunofluorescence with antibodies to the major late 
proteins (hexon, fiber) and early proteins (DBP.E4-orf6). Expressed adenoviral proteins 
are detected only in cells infected with Ad. AAV 1. The fact that cells infected with 
AAd.AAVl do not express potentially cytotoxic adenoviral proteins is important. 

10 

While an MOI of lxlO 4 genomes per cell of the first generation vector Ad.AAVl induce 
cytopathic effects in SKHepl cells at day 3 p.i., no toxic side effects are observed when 
SKHepl cells are infected with AAd.AAVl at a dose of up to lxlO 8 genomes per cell. 
Since the transduction efficiency is clearly dose dependent, AAd.AAVl (which can be 
15 produced at titers of >5xl0 12 genomes/ml) is able to stably transduce Ad5 permissive cell 
lines or tissues with a 1 00% efficiency without associated toxicity. 

Southern analysis indicates that AAd.AAVl integrates randomly as head-to-tail tandem 
repeats into the host cell genome via the right AAV ITR, whereas the other junction with 

20 the chromosomal DNA is variable and occurs somewhere within the transgene cassette. 
In order to confirm the integrated status of AAd.AAVl DNA, high-molecular- weight 
chromosomal DNA is separated by pulse field gel electrophoresis (PFGE), followed by 
Southern analysis with a SEAP specific probe (Figure 3). Undigested DNA from control 
SKHepl cells give an endogenous SEAP signal that co-migrates with chromosomal DNA 

25 just below the well (lanes 1 and 5). No high-molecular weight episomal forms of 
AAd.AAVl DNA are detected, whereas a distinct 35 kb band is visible in DNA from 
SKHepl cells isolated 3 days after infection with first generation adenovirus, Ad.AAVl 
(lanes 4 and 13). Digestion with EcoRI reveals the 4.4kb fragment, which is specific for 
integrated tandem copies of the AAV cassette (lanes 8 and 12). To eliminate the 

30 possibility that chromosomal DNA is trapped in the well, DNA samples are digested with 
intron-encoded endonucleases Pl-Scel or I-Ceul (Gibco-BRL, Grand Island, NY) with a 
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sequence specificity or more than 11 bp or 9bp respectively. Digestion with Pl-Scel 
yields a >2mb endogenous SEAP signal in SKHepl cells (lane 2) and an additional signal 
in the range of ~lmb in G418 resistant colonies transduced with AAd.AAVl (lane 7). I- 
Ceul digestion results in a smear between 250-1000 kb in AAd.AAVl transduced 
5 SKHepl-cells (lanes 10, 11) indicating random integration, whereas a high-molecular 
weight band specific for the endogenous SEAP gene is observed in control SKHepl cells 
(lane 9). 

One day after intraportal infusion of lxlO 12 AAd.AAVl genomes in C57B1/6 mice, BrdU 

10 labeled vector genomes can be detected in 85% hepatocytes (Lieber, A. ; et al. 1999. J 
Virol 73:9314-24). Hepatocellular DNA analysis performed at 2 months post-infusion 
reveals AAd.AAVl DNA integrated with an average of 0.5 copies per cell into the mouse 
genome (Lieber, A., et al. 1999. J Virol 73:9314-24). To assess potential side effects of 
intraportal AAd.AAVl infusion, serum glutamic pyruvic transaminase (SGPT), a 

15 sensitive marker for hepatocellular injury, is measured for 7 consecutive days post- 
infusion in combination with histological analysis of liver sections. No significant 
elevation in SGPT levels, or histological abnormalities are detected after intraportal 
infusion of lxlO 12 or lxlO 13 AAd.AAVl genomes, whereas infusion of the same dose of 
full-length Ad. AAV 1 vector is associated with severe hepatoxicity or fatal outcome. This 

20 suggests that the dose of AAd.AAVl administered to mice can be increased to obtain 
higher transduction efficiencies in vivo without adverse side effects, which is not possible 
for first generation adenoviruses. Importantly, AAd.AAVl transduced quiescent 
hepatocytes in vivo, which suggests that integration of hybrid vector DNA may not 
require cell proliferation. Recently, more detailed in vivo transduction studies with 

25 Ad.AAVl and AAd.AAVl have been performed in Balb/c mice to study whether the 
absence of adenoviral gene expression in cells infected with AAd.AAVl can avoid an 
anti-viral immune response and can prolong vector persistence. In this mouse strain, 
vector DNA is cleared from the liver at 4-6 weeks after infusion with first generation 
adenoviruses, mostly due to a CTL response against viral proteins produced in transduced 

30 cells. Vector DNA is analyzed by genomic Southern Blot of hepatic DNA at 12 weeks 
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after infusion of lxlO 12 genomes Ad. AAV 1 or AAd.AAVl. At this time point, no vector 
specific signal is detectable in hepatic DNA from mice infused with the first generation 
vector Ad.AAVl, while -0.3 copies of AAd.AAVl genomes per cell are present in livers 
of mice that received the hybrid vector, again indicating the superior in vivo properties of 
5 the hybrid vector. 

D. Effects of Ren Coexnression on AAd.AAV Integration 

Rep expression after pi asm id trans feet ion: 

10 

In order to test whether Rep expression enhances site-specific integration of AAd.AAVl 
in human cells, a series of Rep expression plasmids are constructed. 



The Rep ORF 68/78 (nt 285-2313) including the internal pi 9 and p40 promoters is 
obtained from p AAV/Ad (Samulski, R. J. et al., 1991, In B. N. Fields, et al. (eds.), 
Fields Virology, vol. 2 Lippincott-Raven Publisher, Philadelphia) by digestion with 
Bsal/BsrI. This fragment deleted for the AAV p5 promoter is cloned via adapter linkers 
20 under RSV or PGK promoter in front of the bovine growth hormone polyadenylation 
signal (bPA) into pAd.RSV or p Ad. PGK (Lieber, A., and Kay, M.A., 1996, J. of 
Virology, 70, 3153-3158; Lieber, A., et al., 1995, Human Gene Therapy, 6, 5-11) 
correspondingly. 

25 RESULTS 

The resulting plasmids (pRSVrep, pPGKrep) are transfected into 293 cells or SKHepl 
cells, most of the Rep proteins expressed from the heterologous promoters (RSV or PGK) 
are Rep 68 and Rep 78, while transfection of the rep gene under aP5 promoter 
30 (pAAV/Ad) results in predominant Rep 52/40 expression. Thus, transfection of pRSVrep 
and pPGKrep is more pronounced suggesting a strong transactivation of AAV promoters 



METHODS 



15 
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by Ela which is produced in 293 cells. This result indicates that minimum expression of 
rep proteins is necessary to avoid interference with adenovirus replication. 

Rep-mediated site-specific integration of AAd.AA VJ . 

5 

The potential for site-specific integration is an important characteristic of the novel 
Ad.AAV vectors of the invention. In an embodiment of the invention, integration of the 
AAd.AAV is directed by co-infection with Ad AAV expressing the rep 78 protein to 
achieve site-specific integration in the AAVS1 site on human chromosome 19. For this 

10 type of site-specific integration to occur in cells other than 293 cells, E4 ORF6 
expression is required. The co-infection of AAd.AAV. AAd. rep 78, and AAd. E4-orf6 
allows for site specific integration of the AAd.AAV transgene cassette. The AAd. rep78 
and the AAd. E4-orf6 genomes are degraded soon after transduction, thus avoiding 
potential side effects. Site-specific integration is preferred over random integration, 

15 which is seen with rAAV and AAd.AAV, in order to reduce the risk of insertional 
mutagenesis. 

A preliminary test can be performed to confirm the functional activity of Rep 68/78 
expressed from pRSVrep to mediate site-specific integration of AAd.AAVl (Figure 5 and 

20 6). Human SKHepl cells are transfected with pRSVrep or control plasmid (pRSVbGal 
(Lieber, A., et ah, 1995, Human Gene Therapy, 6, 5-11) (transfection efficiency was 
-20%), followed by infection with AAd.AAV (2000 genomes per cell). Three days after 
infection, cells are trypsinized, embedded in agarose, lysed in situ, digested with I-Ceul 
(an intron-encoded endonuclease with a recognition sequence of more than lOnt), 

25 subjected to pulse file gel electrophoresis in 1% agarose gel, and analyzed by Southern 
Blot. Hybridization with a probe covering the AAVS1 integration site (1.7kb 
EcoRI/BamHI fragment from the chromosome 19 locus (Samulski, R. J. et al., 1991, In 
B. N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott-Raven Publisher, 
Philadelphia) ) reveals an AAVS1 -specific band (~240kb) in I-Ceul digested DNA from 

30 cells after control plasmid transfection (pCo) + AAd.AAVl infection. An additional 
signal in the range of 280kb appears in rep expressing cells infected with AAd.AAVl 
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(pRSVrep + VAd.AAVl) indicating a site-specific insertion into the AAVS1 site in a 
certain percentage of cells. The presence of vector DNA in this 280kb band is confirmed 
by rehybridization of the same filter with a transgene (SEAP) specific probe. Randomly 
integrated AAd.AAV 1 vector appears as a diffuse SEAP signal in the range 280-680kb 
5 (pCo+AAd.AAVl, pRSVrep+AAd.AAVl). The specific -1.9 mb band on blots 
hybridized with the SEAP probe represents an I-Ceul fragment containing the 
endogenous human SEAP gene. 

Incorporation Rep 68/78 function into hybrid vectors to stimulate site-specific integration 

10 Rep overexpression inhibits adenovirus DNA replication, prohibiting the generation of 
rep expressing Ad vectors using conventional strategies. To solve this problem, 
significant Rep 68/78 expression from the hybrid vector in virus producer (293) cells 
must be prevented while maintaining transient Rep expression in target cells (HSC) to 
mediate site-specific integration. Our hypothesis is that the specific structure of the 

15 zlAd.AAV hybrid virus can be used to bring the rep gene 68/78 into a transcriptionally 
active position under control of a HSC specific promoter only at late stages of virus 
replication in 293 cells. This will allow amplification of the hybrid vector in 293 cells, 
generating high titer virus which activates the incorporated Rep 68/78 functions only in 
HSC. The general outline of our strategy to produce Rep expressing hybrid vectors is 

20 illustrated in Figure 7. The rep/transgene cassette is assembled based on the left-hand 
shuttle plasmid used for recombinant adenovirus production. The gene encoding Rep 
68/78 is cloned in 3 , D5 f orientation in front of a transgene expression cassette flanked by 
AAV ITRs. Between the transgene cassette and the right AAV ITR an HSC-specific 
promoter is inserted with direction towards the adenoviral E2, E3, and E4 genes. The 

25 recombinant genome is produced by recombination in Exoli and transfection into 293 
cells generates virus (Ad.AAV-rep). The specific structure of AAd.AAV with duplicated 
sequences flanking the AV ITRs is used to bring the rep gene into a transcriptionally 
active position under control of a HSC specific promoter only during late stages of viral 
DNA replication in 293 cells. During amplification of Ad.AAV-rep, the smaller genome 

30 zlAd.AAV-rep is formed and packaged into particles, which can be separated by 
ultracentrifugation in CsCl gradients. The specific structure of AAd.AAV-rep brings the 
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rep gene into 5'«->3' orientation in relation to the HSC specific promoter, allowing rep 
transcription in target cells. After transduction of HSC with purified /lAd.AAV-rep 
particles, rep expression is activated and mediates rescue of the AAV-ITR/transgene 
cassette from the adenoviral vector backbone and site-specific integration. The 
5 hypothesis is that Rep-mediated integration into AAVS1 occurs via the right or both 
AAV/ITRs causing the rep gene to become separated from the hepatocyte-specific 
promoter once the vector is integrated (Figure 7). Therefore, rep expression should be 
only transient without critical cytotoxic side effects on the host cell. 

10 Promoters that can regulate rep expression: 

Potential candidate promoters to drive rep expression with high specificity for HSC and 
minimal activity in 293 cells are the 454nt CD34 promoter (Krause, D.S., et al., 1997, 
Experimental Hemotology, 25, 1051-1061; Yamaguchia, Y. et al., 1997, Biochimica et 

15 Biophysica Acta., 1350:141-6), the 300nt HS 40 enhancer (Chen, H.L., et al., 1997, 
Nucleic Acids Res. 25, 2917-2922) or a 3kb CD34 enhancer (May, G. et al., 1995, EMBO 
J., 14:564-74) in combination with an initiator, or the HIV LTR. An optimal promoter is 
selected based on studies of transient reporter gene expression after plasmid transfection 
in 293 cells and hepatocytes. All promoters to be tested are cloned in front of the human 

20 ai-antitrypsin (hAAT)- bovine growth hormone polyadenylation signal (bPA) into the 
adenoviral shuttle plasmid pCD2 (pAd.-hAAT). Promoter activity can be tested in 
transient plasmid transfection assays in CD34+ and 293 cells. The promoter with the 
highest hAAT levels in CD34+ or K562 cells and the lowest hAAT expression in 293 
cells is selected for further studies. If high background expression in 293 cells from these 

25 promoters is seen, insulators to shield HSC-specific promoters from the Ela enhancer 
which is still present in Ad shuttle plasmids can be utilized. 

Rep genes : 

30 The large Rep 68/78 proteins are sufficient to mediate rescue and site-specific 
integration. Unregulated Rep 52 and Rep 40 expression from the AAV pi 9 promoter 
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located within the ORF of Rep 68 and 78 must be prevented because production of these 
smaller Rep proteins in 293 cells will affect cell viability and adenoviral DNA synthesis. 
To do this, constructs obtained from Surosky et al., containing a mutated Rep 52/40 start 
codon to express Rep 68 and 78 individually under CMV promoter can be used. The 293 
5 cells transiently expressing Rep68 or Rep 78 from these constructs can be coinfected with 
AAd.AAVl (infection 24 hours after pCMVRep transfection, MOI 2xl0 5 genomes/cell). 
Three days after AAd.AAVl infection, cellular DNA is analyzed for AAVS 1 -specific 
integration events by PCR and PFGE as described earlier. Efficient Rep mediated 
excision of the AAV cassette and site-specific integration without flanking adenoviral 
10 sequences are expected and the plasmids pCMVRep68 or pCMVRep78 can be used as a 
source for the corresponding rep genes and clone them into hybrid vectors. 

Vectors: 

15 The rep/transgene cassette can be assembled based on pXCJL (Microbix, Toronto). A set 
of control hybrid vectors can be generated with the AAV-ITR-transgene cassette only 
without the rep gene. The recombinant Ad.AAV-rep genome can be generated by 
recombination of the left hand shuttle plasmids with pCDl, a pBHGlO (Microbix, 
Toronto) derivative, which contains the Ad5 genome deleted for the E1/E3 regions in 

20 recA + E. coli (Chartier, C, et al., 1996, J. of Virology, 70, 4805-4810). Compared to the 
standard technique based on plasmid recombination in 293 cells, this approach has the 
advantage that plaques with recombinant virus appear 3 times faster and the production 
of illegitimate recombinants is minimized. This allows efficient viral DNA amplification 
and packaging to occur before Rep expression reaches levels that are potentially 

25 inhibitory for adenoviral replication. The critical variables in maximizing the output of 
the vector deleted for all adenoviral viral genes are the initial multiplicity of infection and 
the time of harvesting. These parameters can be optimized for production of AAd.AAV- 
rep hybrid vectors. A number of AAd.AAV vectors can be constructed incorporating rep 
gene. Cryptic promoter and enhancer elements present in the 5'-342nt of the adenoviral 

30 genome can interfere with transgene expression from the heterologous promoters. This is 
crucial for the strategy to avoid rep expression from AAd.AAV-rep genomes in 293 cells. 
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To ensure efficient transgene expression, insulator fragments such as the chicken beta- 
globin insulator can be used with a selected promoter, constitutive or inducible. 

Rep protein co-packaging: 



As an alternative to producing hybrid vectors containing the rep 68/78 gene, studies are 
designed to see whether Rep protein can be co-packaged into AAd.AAV capsids and 
whether these co-packaged Rep molecules are sufficient to mediate rescue and site- 
specific integration of the AAV-ITR-transgene cassette. Our hypothesis is that the Rep 

10 68/78 binds to the Rep binding site (RBS) present in double-stranded AAd.AAV genome 
and that this complex is co-packaged into adenoviral capsids which are spacious enough 
to accommodate extra proteins. Based on protein/DNA ratio analysis performed 
previously in purified particles that only one 5.5kb AAd.AAV 1 genome is packaged per 
capsid. This is confirmed by electron-microscopy of AAd.AAV 1 particles, which reveals 

15 only spotted electron-dense staining associated with viral cores and extended free luminal 
space (see Figure 2). 

293 cells are transfected with plasmids expressing Rep 68/78 under the CMV promoter 
and the kinetics of rep expression is determined by Western Blot with cell lysates 

20 collected at different time points after transfection. Next, these 293 cells are infected 
with Ad. AAV (MOI 1, 10, 100 pfu/cell) at specific time points after transfection of Rep 
plasmids depending on the Rep expression kinetics (e.g. 3, 6, 12, 24 . . . hours after 
transfection). It is important to time Ad.AAV infection exactly because viral DNA 
replication must be taking place or finished before Rep production reaches peak levels. 

25 In general, adenovirus DNA replication in 293 cells (infected with MOI 10) is maximal at 
1 8 hours post-infection, followed by production of structural proteins, packaging of viral 
genomes, and breakdown of cellular membrane structures (which is concluded -36-48h 
p.i.) (Shenk, T., 1996, In B. N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott- 
Raven Publisher, Philadelphia; van der Vliet, B., 1995, In w. Doerfler, et al. (eds.) vol. 2 

30 p. 1-31, Springer-Verlag, Berlin). Viruses are collected 48h after infection and banded 
by CsCl ultracentrifugation. Viral material from purified bands corresponding to 
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AAd.AAV is lysed, DNAse-treated (to liberate DNA associate Rep) and subjected to 
immunoprecipitation- Western Blot with Rep specific antibodies to detect co-packaged 
Rep. Based on theoretical calculations assuming that two Rep molecules bind per Ad 
genome, ~l-10ng Rep proteins is expected from Lysates of 10 10 particles, which is within 
5 the range of detectability by Western Blot. Alternatively, co-packaged Rep may be 
detected based on its functional activity to mediate rescue and site-specific integration of 
the AAVITR transgene cassette. To test whether functional Rep protein is co-packaged 
into hybrid vector particles, CsCl purified AAd.AAV 1 particles generated in 293 cells co- 
expressing Rep after Ad/AAV 1 infection (AAd.AAV 1+Rep) can be used for transduction 
10 studies. Three days after AAd.AAV 1+Rep infection of the human cell line K562, cellular 
DNA is analyzed for AAVS1 -specific integration events by PCR and PFGE. If efficient 
Rep-mediated site-specific integration of excised AAV cassettes is successful, then other 
AAd.AAV-KRep hybrid vectors with P-Gal and SNori as transgenes can be produced. 

15 Integration studies with Rep vectors in erythroid cells: 

The hypotheses behind the rational of a rep-expressing hybrid vector (AAd.AAV-rep) 
are: (1) transient Rep co-expression from AAd.AAV-rep vectors can enhance site- 
specific vector integration in human cells and (2) integration occurs via the AAV ITR(s) 

20 without the rep gene, which is placed outside the AAV cassette, thus eliminating rep 
expression upon vector integration. To test hypothesis 1. transduction frequencies of 
AAd.AAV/rep versus AAd.AAV vectors can be compared based on the formation of 
G418 resistant colonies and quantify site-specific integration events at different time 
points after infection of human and mouse cells by PFGE and PCR. To test hypothesis 2, 

25 the structure of integrated vector in transduced cell populations and single clones can be 
delineated by Southern analysis and by sequencing of vector/chromosomal DNA 
junctions. These studies can be performed with AAd.AAV-rep,AAd.AAV, and 
AAd.AAV+Rep (copackaged protein) in human K562 or HEL (for AAVS1 integration) 
and mouse MEL cell lines. 

30 



50 




WO 00/73478 PCT/US00/15442 

Cells infected with AAd.AAV-SNori, AAd.AAV-SNori+Rep or AAd.AAV-Snori-rep can 
be subjected to G418 selection. The number of G418 resistant colonies determined after 
4 weeks of selection in relation to the number of initially infected cells. The selection 
process for colonies that did not survive continued selection due to potential rep-mediated 
5 cytotoxicity or episomal vector expression can be monitored. If rep expression from 
AAd.AAV-SNori rep does not affect cell viability and proliferation, then more G418 
resistant colonies should appear in AAd.AAV-SNori-rep and AAd.AAV-Snori+Rep. The 
structure of integrated vector can be determined by Southern Blot and sequencing of 
integration junctions. 

10 

To uncover a potential selection bias against rep producing cells after transduction with 
AAd.AAV/rep. site-specific and random vector integration events can be quantitated in 
cellular DMA isolated from cell populations at different time points after infection (e.g. 
0.5, 1,3,7. 14 days). To do this, the techniques based on PFGE-Southern can be utilized. 
15 It is expected that the signal(s) for AAVS1 -specific integration in AAd.AAV/rep infected 
human cells increases during the first days after infection and then remains constant over 
time. 

In a separate study, the integration status of vector DNA (analyzed by PFGE or PCR) and 
20 the number of integrated copies (analyzed by Southern Blot) with the expression level of 
P-galactosidase in single clones transduced with [3-Gal hybrid vectors (AAd.AAV-BG, 
AAd.AAV-BG+Rep, or AAd.AAV-BG-rep) can be correlated. Together with data 
obtained in the studies described in the Specification, this allows assessment of whether 
transcriptional silencing is associated with site-specific vector integration into the 
25 AAVS1 site. 

It is not clear a priori whether the specific Rep function for vector rescue, 
concatemerization, and integration can efficiently occur in non-S-phase or non-dividing 
cells. To test whether AAd.AAV* , AAd.AAV or AAd.AAV-rep/+Rep vectors can 
30 integrate into non-dividing cells, transduction studies in cell cycle arrested cell cultures 
can be performed as described earlier. 
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DISCUSSION 

The establishment of stable cell lines expressing Rep 68/78 at detectable levels is not 
possible, which is probably due to rep mediated cytotoxicity. Therefore, it is not possible 
to perform long-term transduction studies (e.g. G418 selection or studies in single clones) 
in combination with ectopic rep expression. Moreover, due to the inhibitory effect of rep 
on adenovirus replication, it is currently not possible to generate adenoviral vectors 
expressing rep under the RSV or PGK promoter. 

Taken together, this indicates that co-expressed Rep may stimulate site-specific transgene 
integration. 

E. A Detailed Study Of Transduction/lntegration Of Hybrid Vectors In Erythroid 
Cell Lines: 

In order to improve transduction and integration frequencies of the hybrid vectors into 
erythroid cell lines, a detailed study comparing various hybrid vectors have to be carried 
out as described below. The transduction studies are performed in K562 cells which is 
considered to be an adequate model to study gene transfer vehicles into erythroid cells 
(Floch, V., et aL, 1997, Blood Cells, Mol and Diseases.23, 69-87). The optimal vectors 
should be able to integrate into the cellular genome with a high frequency, determined by 
Pulse field gel electrophoresis (PFGE) and Southern blot as described in Example 4. In 
addition, the results from the following studies will serve to evaluate whether a given 
hybrid vector needs to be modified for site-specific integration in the host genome. 

Sequencing of integration junctions: 

The ultimate proof for vector integration is the sequencing of junctions between SNori 
vector DNA and chromosomal DNA. Furthermore, this clarifies the question whether the 
AAV ITRs represent the substrate for integration. Specifically, DNA from clones with 
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known AAd.AAVSNori integration structure (analyzed by Southern Blot) digested with 
EcoRI, which does not cut within the SNori cassette. The resulting fragments are 
circularized and transformed into a specific E. coli strain (according to the protocol 
described by Rutledge and Russell (Rutledge, E. A. et al., 1997, Journal of Virology, 
5 71:8429-36)). Kanamycin resistant bacterial clones should contain the integrated SNori 
cassette. Flanking chromosomal DNA in rescued plasm ids can be sequenced with 
primers specific to the transgene. 

To confirm vector integration in a small number of transduced cells, genomic DNA is 
10 extracted and digested with EcoRI. EcoRI fragments are ligated to linkers containing a 
specific primer binding site and are then digested with NotI, religated and propagated in 
E. coli. Plasmid DNA from a representative number of bacterial clones is sequenced to 
determine the vector/chromosomal DNA junctions. 

15 Dose dependent toxicity: 

In order to test that the transduction frequence is dose-dependent and AAd.AAV vectors, 
which are devoid of all adenoviral genes, could be used to infect cells at higher doses 
with less cytotoxicity than first generation adenovirus, K562 cells are infected with 

20 different MOIs (1-1 0 s ) of AAd.AAVBG and the first generation vector Ad.AAVBG 
(which contains the same P-Gal expression cassette). At day 4 post-infection, the total 
number of cells, the percentage of viable cells (based on trypan blue exclusion) and the 
percentage of X-Gal positive cells are counted. A fraction of infected cells are quantified 
for P-Gal expression using the Galacto-Light kit. The level of transgene expression is 

25 expected to be comparable between the two vectors. K562 cells are predicted to tolerate 
higher doses of AAd.AAVBG better than Ad.AAVBG which express viral genes. 

Integration frequency with and without G4 18 selection: 

30 In order to investigate the integration frequency of the different vectors and to confirm 
that AAV ITRs present in double-stranded adenoviral DNA genomes can mediate vector 
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integration with a frequency comparable to rAAV vectors, integration studies are 
performed based on the formation of G418 resistant colonies with AAd.AAVSNori, 
AdSNori, Ad.SNorilTR, and rAAVSNori after infection with 2x1 0 5 and 2x1 0 6 genomes 
per cell (Fig. 8). After infection, cells are plated in 96 well plates under limiting dilution 

5 and selected with G418 to estimate the frequency of formation of G418 resistant colonies. 
Another set of cells is plated without G418. A representative number of clones (w/ and 
w/o G418 selection) are expanded to >10 6 cells (after 3-4 weeks of culture) and analyzed 
for the presence of viral DNA by Southern Blot as well as PFGE analysis to discriminate 
between episomal vector DNA and vector genomes stably associated with chromosomal 

10 DNA. This allows us to estimate the integration frequency of the different vectors, to 
assess the effect of G418 selection on integration, and to consider position effects on neo 
expression in calculating the total integration frequency. Integrated vector copies with a 
frequency of at least IxlO" 4 is predicted only for AAd.AAVSNori and rAAVSNori. The 
total number of colonies may be lower in both the first generation vectors, 

15 Ad.AAVSNorilTR and Ad.SNori, due to the toxic effects of expressed adenoviral 
proteins; however, a higher integration frequency is predicted for the vector containing 
the AAV ITRs (Ad.AAVSNorilTR). 

Kinetics of integration: 

20 

Compared to rAAV, the double-stranded nature of entering AAd.AAV genomes provides 
more protection against degradation. Furthermore, the synthesis of transcriptionally 
active double-stranded intermediates from single-stranded genomes, which is considered 
a limiting step in rAAV transduction, is not required in AAd.AAV transduction. Thus, 

25 the lag phase between infection and expression seen with rAAV vectors, which is 
causally linked to double-strand synthesis/integration may be shorter or absent in 
infections with AAd.AAV vectors. Furthermore, it was demonstrated earlier that a 9kb 
mini-adenoviral genome packaged into adenoviral particles is only short lived and 
completely degraded by day 3 post-infusion. In contrast, transduction with the 5.5kb 

30 AAd.AAV 1 (Figure 1) genome allows for long-term expression, suggesting that either 
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AAV ITRs can stabilize the viral genome as an episome until it is integrated or 
integration occurs shortly after infection. 

The status of vector DNA can be examined in K562 cells at different time points after 
5 infection with AAd.AAVSNori, AdSNori, Ad.AAVSNorilTR, or rAAVSNori (MOI 
2xl0 5 ). Infected cells are harvested at 1 hour, 5 hours, 1 day ? 3, 7, and 14 days after 
infection and chromosomal DNA is analyzed by PFGE followed by hybridization with a 
transgene specific probe. This technique allows us to distinguish between episomal 
vector DNA, which appears as a distinct 5.0kb band and integrated DNA. Furthermore, 
10 extra chromosomal high-molecular weight vector concatemers can be detected. In the 
case of random integration, after digestion of chromosomal DNA with I-Ceul or Pl-Scel, 
vector-specific signals in the range of l-2mb should be seen. The intensity of episomal 
and integrated vector signal is quantified for each time point using phosphoimager 
analysis. This gives information about the kinetics of hybrid vector integration in a 
15 population of infected K562 cells and the intracellular stability of hybrid vector genomes. 

Structure of integrated vector DNA and integration junctions with chromosomal DNA: 

AAd.AAVl integrates as concatemer/s randomly into host DNA as shown previously. 

20 How many vector copies are present in one concatemer and whether the extent and the 
kinetics of tandem-formation are dose dependent still remain unclear. Another 
unanswered question is how AAd.AAV integrates: whether one or both ITRs are 
involved, whether the integrated ITRs are still intact, and whether adenoviral sequences 
integrate as well. These issues are important for the strategy to include rep genes into the 

25 hybrid vector genome. Moreover, if intact AAV ITRs are present within integrated 
vector copies, helper virus (adenovirus or HSV) infection in vivomay mobilize the 
integrated AAV-TTR vector cassette and affect stability of transgene expression. 

To answer these questions, K562 cells can be infected with AAd.AAVSNori, AdSNori, 
30 Ad.AAVSNorilTR, and rAAVSNori at MOIs 2x10 s , 2xl0 6 , or 2xl0 7 genomes per cell. 
Infected cells are plated in 96 well plates in the presence or the absence of G418. The 
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latter is included because G418 may cause amplification of integrated vector DNA 
(Rutledge, E. A. et al., 1997, Journal of Virology, 71:8429-36). Genomic DNA from 
isolated clones can be analyzed by regular Southern Blot as described in the Examples 
Section to confirm the presence of vector concatemers and calculate the number of 
5 integrated vector copies. More informative is the sequencing of integrated vector copies 
and their junctions with chromosomal DNA. The structure of integration junctions can 
be delineated using the role of AAV ITRs in vector integration and the extent of 
insertional mutagenesis after transduction. This data provides information about the 
potential risks of hybrid vector used in clinical trials. 

10 

Transduction of cell cycle arrested cells: 

The ultimate target for the hybrid vectors described in the Specification are quiescent 
hematopoietic stem cells. We hypothesize that the double-stranded nature of AAd.AAV 

15 genomes and specific nuclear import mechanisms may allow for the transduction of non- 
dividing cells. This is in part supported by the transduction studies with AAd.AAV 1 in 
quiescent hepatocytes in vivo. To confirm this data, primary fibroblasts can be forced to 
enter the Go phase by serum/growth factor starvation before infection with the hybrid 
vectors according to a protocol described by Russell (Russell, D. et al., 1995, PNAS, 

20 92:5719-23). Cells are maintained for three days after infection under serum/growth 
factor deprivation. At this time point, genomic DNA is isolated and analyzed for 
integration events by PFGE in comparison with growing cells. Another series of 
integration studies can be carried out on K562 cells arrested in the Gj/S phase of the cell 
cycle with aphidicolin (added 1 day before and maintained several days after infection 

25 with hybrid vectors - depending on the integration kinetics studies described earlier). To 
investigate whether DNA damaging agents increase the transduction frequency of hybrid 
vectors, cell-cycle-arrested K562 cells or primary fibroblasts can be treated with 
cisplatinum or 3 H-thymidine prior to virus infection according to a protocol described by 
Alexander and Russell (Alexander, I, E. et al., 1994, J. Virol, 68:8282-87; Russell, D. et 

30 al., 1995, PNAS, 92:5719-23). Furthermore, the effect of chromosomal DNA 
decondensation on the transduction efficiency of hybrid vectors can be studied in arrested 
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genome is „„, ^ and „ digested „ , he Xho| s . Ks ^ 

cassette are d e,e,e d by siK . djreacd J ^ 

« only upon . large pa„ of Ad DNA when ^.ion is comp]ete(| ,„ 
*-«*p- o„ opting „,e ^ JR ^ • 

(Blague, C.etal.,200fi,BW, 95:820-8). 



EXAMPLE II 
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~S0 ta*. serotypes, i, is though, tha, different adenovirus serotypes bind to different 
ceHular receptor proteins „ r use dlfferem mry 
Fields, e, al. (eds.), «„* Virolo ^ v0 ,. 2 r.,^.^ ^ 
Mannas, P. «, a,., ,994, Jo»r ml ofv,^, o 8 :68, ,.,4; Defer, M., e, ,003 ' ^ 
. ^ 64. 366,-3673). A„h„u 8 h most adenoses contain ROD motifs " h ^ 
penton »« p r„,ei„ s , .here are a number of serotypes ,.. g . Ad 40 , 41) „„„„„, ^ 
conserve seauer^e. These types may use intern Ov-i„dep.„de„, pathways fc vinis 
(Davison, A ,, e.a>, ,o 93> ., „ 4 , 1308-1316; Mathias, P. et 

■ .nfect stem cei, subpopulation present in huntan hone marrow, studies with a series of 
Afferent human Ad serotypes and animal viruses can be performed (see Table II) As a 
means ,„ verity efficient taction with Ad serotypes, vira, DNA is tagged before 
mfectton and the presence of vta. genomes in the nuclei of adduced ^ „ 
mvesttgated. Furthermore, whether vira. DNA is replicated in hansduced ce„s can he 
-*-d as indirect proof for early vira, gene expression. A direct detection of expressed 
vtml protetns is impossible due ,o the usability of antibodies against all the 
serotypes included in mis study. Simultaneously with the infection assay, transduced 
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human bone marrow cells can be analyzed for morphological and immunohistochemical 
features characteristic of HSC or progenitor subpopulations. For retargeting, serotypes 
whrch are able to infect CD34+ subsets of bone marrow cells at the lowest MOI are 
selected. As the next step, the fiber gene is PCR-cloned from serotypes with potential 
HSC/CD34+ tropism and inserted into standard shuttle plasmids for Ad5 vector 
generation replacing the Ad5 fiber gene using an E. coli recombination system (Figure 

METHODS 
Cells and viruxes- 

HeLa (human cervix carcinoma, ATCC CCL-2.2), CHO (Chinese hamster ovary, ATCC 
CCL-61). K562 (human hematopoietic, ATCC 45506), HEp-2 (human larynx carcinoma, 
ATCC CCL-23), 293 (human embryonic kidney. Microbix, Toronto Canada) cells were 
maintained in DMEM, 10% FCS, 2 mM glutamine, and Pen/Strep. Culture media for 
CHO cells was supplemented with 200uM asparagine and 200 M M proline. Human 
CD34 + -enriched bone marrow cells were purified from peripheral blood after 
mobilization using MiniMACS VS + separation columns (Miltenyi Biotec. Auburn, CA), 
according to the manufacturer's instructions. Aliquots were stored in liquid nitrogen.' 
Sixteen hours before the experiment, cells were recovered from the frozen stock and 
incubated overnight in IMDM media, supplemented with 20% FCS, 10" 4 M p- 
mercaptoethanol, 100 ug/ml DNasel, 2 mM glutamine, 10 U/ml IL-3, and 50 ng/ml stem 
cell factor (SCF) or 2 ng/ml thrombopoietin (Tpo). The purity of CD34+ preparations 
was verified by flow cytometry and was consistently greater than 90%. 

Flow cytometry: 

Adherent cells (CHO, HeLa) grown in non-tissue culture treated 10 cm dishes (Falcon, 
Franklin Lakes, NJ) were detached by treatment with ImM EDTA and washed three' 
times with wash buffer (WB), consisting of PBS supplemented with 1% FCS. Cells 
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grown in suspension (K562, CD34+) were washed three times with WB. After washing, 
cells were resuspended in WB at 2 x 1 0 6 cells/ml. 2 x 10 5 cells were incubated in WB for 
1 h at 37°C with monoclonal antibodies specific for a v -integrins [L230, ATCC: HB- 
8448, (Rodriguez, E., Everitt, E. 1999. Arch. Virol. 144:787-795) (1/30 final dilution), 
CAR [RmcB (Bergelson, J. M., et al. 1997. Science. 275:1320-1323; Hsu, K.-H., L., et al.' 
1988. J. Virology. 62:1647-1652) (1/400 final dilution)], or BrdU [(Amersham, Arlington 
Heights, IL) (1/100 finaJ dilution)]. Subsequently, cells were washed with WB, and 
incubated with fluorescein isothiocyanate (FITC)-labeled horse anti-mouse IgG 
antibodies [(Vector Labs., Burlingame, CA) (1/100 final dilution)] or phycoerythrin (PE)- 
labeled goat anti-mouse IgG antibodies [(Calbiochem, La Jolla, CA) 1:100 dilution] for 
30 min at 4°C. After incubation with secondary antibodies, cells were washed two times 
with WB and 10 4 cells per sample were analyzed in duplicate by flow cytometry. 

For the analysis of CD34 and c-kit expression on transduced CD34+-cells and for 
fluorescent activated cell sorting (FACS), purified human CD34+ cells were incubated 
with phycoerythrin(PE)-conjugated anti-CD34 monoclonal antibodies (Becton-Dickinson 
Immunocytochemistry Systems, San Jose, CA) or with PE-labeled anti-CDH7 (c-kit) 
monoclonal antibodies (MAb 95C3, Immunotech, Beckman Coulter, Marseille, France) 
according to the manufacturer's protocol followed by flow cytometry analysis. All 
analyses and sortings were performed on a FACStar Plus flow cytometer (Becton 
Dickinson, Franklin Lakes, NJ) equipped with 488nm argon and 633 nm HeNe lasers. 
For analysis of c-kit expression and FACS purification of CD34+/c-kit+ cells, SCF was 
not added to the media during culturing of CD34+ cells. 

25 RESULTS 

CAR/oCv -integrin expression on test cells: 

It is generally accepted that CD34+ cells possess bone marrow repopulating activity. 
30 Therefore, we used human CD34+ cells as the target for our studies towards identifying 
Ad serotypes with HSC tropism and constructing new viral vectors. Studies were 
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performed on mobilized, CD34-positive, peripheral blood cells from one donor under 
conditions which are known to retain CD34+ cells in a quiescent stage (Leitner, A., et al. 
1996. Br. J. Haematol, 92:255-262; Roberts, A. W., Metcalf, D. 1995. Blood 86:1600- 
1605). More than 90% of purified cells were CD34 positive by flow cytometry. 
5 Furthermore, we included into our Ad tropism studies the cell line K562, which is 
considered to be an adequate model for studying gene transfer into human hematopoietic 
cells (McGuckin, et al. 1996. British Journal of Haematology. 95:457-460). HeLa cells, 
which are readily infectible by Ad5, and CHO cells, which are refractory to Ad5 infection 
(Antoniou, M. et al., 1998, Nucleic Acid Res., 26:721-9), were used as positive and 
10 negative control cell lines, respectively. 

For Ad5, both, binding to the primary receptor and to a 3 p 5 and a m $ 5 integrins are 
important for high efficiency infection of target cells. The expression of CAR and a v 
integrins on test cells was analyzed by flow cytometry using monoclonal antibodies 

15 against CAR (RmcB (Bergelson, J. M., et al. 1997. Science. 275:1320-1323; Hsu, K.-H., 
L., et al. 1988. J. Virology. 62:1647-1652)) and ct v integrins (L230 (Roelvink, P. W., et 
al. 1996. J. Virology. 70:7614-7621)) (Figure 10). As expected, nearly all HeLa cells 
expressed high levels of CAR and a v -integrins, whereas CHO cells lacked significant 
CAR and a v -integrin expression. Fifteen and 77% of K562 cells expressed CAR and a v - 

20 integrins, respectively. Only -6% of the CD34+ cells used in our studies expressed CAR 
and 17% were positive for a v -integrins. Notably, the preparation of CD34+ cells 
represents a mixture of different cell types. The absent or low expression of primary and 
secondary Ad5 receptors on non-cycling human CD34+ cells is in agreement with 
previous reports (Huang, S., et al. 1996. J. Virology. 70:4502-4508; Neering, S. J., et al. 

25 1996.. Blood. 88:1147-1155; Tomko, R. P., et al.. 1997. Proc. Natl. Acad ScL USA. 
94:3352-3356). 

Infection assay using wild-type Ad5 and K562 cells: 

30 The presence of viral DNA in the nucleus of infected cells is an indirect means to 
demonstrate efficient virus binding, internalization, and nuclear import. Nuclear 
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localization of the viral genome is a prerequisite for transgene transcription and 
integration. Two techniques are utilized to tag viral DNA for in situ analysis. To 
optimize the infection assay, wild-type Ad5 virus and K562 cells which are permissive 
for Ad5 infection can be used. The first protocol (Challberg, S.S. and Ketner, S. 1981, 
Virology 1 14, 196-209), is based on 32 P-labeling of viral DNA. During amplification of 
wild-type Ad5 and A549 cells, 32 P-phosphate (40^Ci/ml) is added to phosphate-free 
medium. After development of CPE, 32 P-tagged virus is harvested, banded in CsCl 
gradients, and titered on HeLa cells according to standard protocols. To simulate the 
conditions for infection of human bone marrow cells, K562 cells are incubated in 
suspension with a MOI of 1, 10, or 100 of 32 P-Ad5 for 2, 4, 6, or 8 hours under agitation 
at 37°C. This covers the time period necessary for adsorption, internalization, and 
nuclear import. After washing, cells are fixed either transferred to microscopy slides 
using cytospin or embedded in paraffin and sectioned (according to protocols from 
VECTOR labs, Burlingham, CA). The latter has the potential advantage that multiple 
consecutive sections (S^trn) of the same cell can be analyzed by different methods (e.g. 
for P tagged viral DNA, for specific histological staining, for immunofluorescence), 
which allows for correlating infection with a particular cell type present in the bone 
marrow. Cells are incubated in a Kodak NTB-2 photo emulsion for autoradiography. 
The exposure time can be optimized to minimize background or non-nuclear localized 
signals. A dose and time dependent appearance of nuclear silver grains is expected under 
the optimized conditions. Since 32 P-phosphate can label viral proteins as well, a 
cytoplasmic background signal might appear. To facilitate detection, 
immunofluorescence with HSC specific antibodies on sections can be performed. As an 
alternative method, a BrdU-labeling technique for viral DNA can be used (Lieber, A., et 
al. 1999. J Virol 73:9314-24; Lieber, A. et al„ 1996, Journal of Virology, 70:8944-60). 
In this case, different amounts of BrdU are added to the A549 culture medium during 
wtAdS virus propagation. BrdU labeled viral DNA can be detected with monoclonal 
antibodies specific to BrdU. The signal can be enhanced using layers of species-specific 
polyclonal antibodies in combination with biotin/avidin and a fluorescent marker. BrdU 
tagged viral DNA can be detected on cytospins of bone marrow cells together with cell 
surface markers by double or triple immunoflourescence. 
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DISCUSSION 

The interaction of selected Ad serotypes with CD34+ cells was tested. As a result of this 
5 screening we constructed a first-generation, Ad5-based vector whose fiber was 
substituted with the fiber derived from Ad35. We demonstrated that this capsid 
modification allowed for efficient viral transduction of potential HSCs by the 
corresponding chimeric Ad vectors. 

10 All tropism and transduction studies were performed with non-cycling CD34+ cells, 
which are thought to include HSCs. The quiescent stage of CD34+ cells purified from 
mobilized blood is important because induction of cell proliferation is associated with a 
loss of the ability to reconstitute hematopoiesis and with changes in the spectrum of 
cellular receptors (Becker, P. S. ? et al. 1999. Exp. HematoL 27:533-541). It is known that 

15 treatment of hematopoietic cells with cytokines or growth factors changes the expression 
of specific integrins including a v -integrins, which would ultimately alter the 
susceptibility of ceils to Ad infection or may effect viability of infected cells (Gonzalez, 
R„ et al. 1999. Gene Therapy. 6:314-320; Huang, S., et al. 1995. J. Virology. 69:2257- 
2263). Another fact that complicates the interpretation of transduction studies is the 

20 extraordinary heterogeneity of CD34+ cells in regards to morphology and function. 

B. Screening different adenoviruses to establish tropism to HSC. 

The ATCC provides more than 70 different human or animal adenoviruses (see Appendix 
25 I). A collection of 15 human serotypes and 6 animal adenoviruses (see Table II) are 
selected based on the following criteria: (i) availability of the complete genome sequence 
or fiber sequence from the NIH gene bank (ii) CAR receptor usage absent or unknown, 
(iii) different subgroups, and (iv) moderate or low tumorigenicity (Shenk, T., 1996, In B. 
N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott-Raven Publisher, Philadelphia). 
30 However, any serotype shown in the Appendix hereto can be used for the invention 
described. Animal viruses are included in the infectivity assay because this may provide a 
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means to circumvent the pre-existing humoral immunity against human Ad5 fiber, which 
represents a critical obstacle for clinical trials with Ad vectors. 

METHODS 



The following human adenovirus serotypes were purchased from the ATCC: 3 (VR-3), 4 
(VR1081), 5 (VR-5), 9 (VR1086), 35 (VR-716) and 41(VR-930). Adenovirus No. VR- 

10 716 was purchased from ATCC labeled as serotype 34, however it was found to be 
serotype 35 upon sequencing of the fiber region. For amplification, the corresponding 
Ads were infected onto HeLa, 293, or HEp-2 cells under conditions that prevented cross- 
contamination. Virus was banded in CsCl gradients, dialyzed and stored in aliquots as 
described elsewhere (Lieber, A., C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). 

15 Plaque titering was performed as follows: Confluent 293 cells plated in 6-well plates 
were incubated for 24 hours with virus in a total volume of 1ml. Two weeks after 
infection, plaques were counted on cultures overlayed with 1% agarose/MEM/10% 
FCS.O 

20 EM studies: 

CsCl-banded Ad stocks were thawed and diluted with 0.5% glutaraldehyde. Grids were 
prepared as described earlier (Mittereder, N., et al. 3996. J. Virology. 70:7498-7509). 
After staining with 2% methylamine tungstate (Nanoprobes, Stony Brook, NY), the 
25 carbon-coated grids were evaluated and photomicrographed with a Phillips 410 electron 
microscope, operated at 80 kV (final magnification 85,000x). For each particular Ad 
serotype, the number of morphologically deficient viral particles per 100 was counted in 
five random fields. 

30 RESULTS 



5 



Viruses: 
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Electron microscopy: 

Little is known about the stability of particles from serotypes other than Ad5. Since the 
intactness of viral particles was crucial for comparative interaction studies, virions from 
5 the serotypes specified above were analyzed by electron microscopy (EM). EM studies of 
negative contrast stained Ad suspensions demonstrated that the percentage of defective 
particles (loss of icosahedral shape or luminal staining) did not exceed 5% indicating that 
serotype preparations had comparable qualities. Representative EM photographs are 
shown for Ads 5, 9, and 35 (Figure 1 1 ). 



Serotype screening: 

It is thought that different Ad serotypes bind to different cellular receptor proteins and 
use different entry mechanisms (Defer, C, et al., P. 1990. J. Virology. 64:3661-3673; 

15 Mathias, P., et al. 1994. Journal of Virology. 68:6811-6814). A set of human 
adenoviruses was obtained from the ATCC to be tested for tropism to CD34+ cells. 
These included serotypes 3, 4, 5, 9, 35, and 41 representing different subtypes (Table 1). 
We believed that these serotypes would use different cellular attachment and 
internalization strategies due to differing lengths of fiber shafts (Chroboczek, J., et al. 

20 1995. Adenovirus fiber, p. 163-200. In a. P. B. W. Doerfler (ed.), The molecular 
repertoire of adenoviruses, vol. 1. Springer Verlag, Berlin; Roelvink, P. W., et al. 1998. 
J. Virology. 72:7909-7915), the presence or absence of RGD motifs within the penton 
base, and differing tissue tropism. The relatively little characterized Ad35 was selected 
because it was found in immunocompromised hosts, particularly in bone marrow 

25 recipients (Flomenberg, P., et al. 1994. Journal of Infectious Diseases, 169:775-781; 
Flomenberg, P. R., et al. 1987. Journal of Infectious Diseases. 155:1127-1134; Shields, 
A. F., et al. 1985 New England Journal of Medicine. 312:529-533). The latter 
observations prompted us to believe that bone marrow cells are among the natural 
reservoirs for Ad35. 
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TABLE II 



Human and animal adenoviruses with potential interest for the invention 



Adenoviru 
s 


Human/ 
Group B 


Human/ 

Group 

D 


Human/ 

Group 

F 


Avian 


Bovine 


Canin 
e 


Ovin 
e 


Swine 


Mous 
e 


Serotype 


2,7,11,16, 
21,34,35 


8,15,17, 

19,28,3 

7 


40,41 


CELO, 
EDS 


3 


1,2 


5 


4 


1 



The underlined serotypes use CAR independent pathways for cell entry. 



5 For amplification, the corresponding adenovirus stocks can be infected onto HeLa or 
A549 cells such that at a given time only one virus type is handled in a separate laminar 
flow hood and cultured in Hepa-filtered bottles, preferentially in separate C0 2 incubators 
to avoid cross-contamination. During propagation, viral DN A is tagged using one of the 
techniques described earlier. Viral DNA can be isolated from purified particles. The 
10 Xhol restriction pattern is analyzed for methylated and unmethylated viral DNA by 
Southern blot using the full genome of the corresponding virus type as a radioactive 
probe. 

DISCUSSION 

15 

Although it was reported earlier by slot-blot assay that fiber knobs derived from 2, 9, 4, 
and 41L can bind to CAR (Roelvink, P. W., et al.. 1998. J. Virology. 72:7909-7915), it is 
not clear whether this binding occurs with an affinity that is physiologically relevant and 
whether this would confer cell entry. Furthermore, as shown for the Ad5 interaction 

20 between the penton and intergrins, a secondary receptor is required to induce virus 
internalization. We demonstrated that different serotypes interacted differently with the 
K562 or CD34+ target cells. Ad5, Ad4, and Ad41 were not able to efficiently attach to 
and be internalized by K562 and CD34+ cells. Although Ad4 belongs to a separate 
subgroup (E), it is thought that Ad4 represents a natural hybrid between subgroup B and 

25 C viruses with a fiber related to Ad5 (Gruber, W. C, et al. 1993. Virology. 196:603-61 1). 
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Therefore, it was not surprising that Ad4 has binding properties similar to Ad5. The 
subgroup F serotype Ad41 has been shown to contain distinct fibers, a long shafted and a 
short-shafted fiber allowing for different cell entry pathways (Tiemessen, C. T., Kidd, 
A.H. 1995. J. Gen. Virol 76:481-497). The Ad41 penton base does not contain RGD 
motifs suggesting that this virus may use a v -intregrin independent pathways for cell 
entry. However these features did not improve interaction with CD34+ cells. Ad9, Ad3, 
and Ad35 did interact with CD34+ cells more efficiently than Ad5. Out of all the 
serotypes tested, Ad35 demonstrated the most efficient attachment and internalization 
with K562 and CD34+ cells. Although the short-shafted Ad9 can bind to CAR, it 
preferentially uses ct v -integrins for cell entry (Roelvink, P. W., et al. 1996. J. Virology. 
70:7614-7621). Therefore, the low level of a v -integrin expression on certain subset/s of 
CD34+ cells may account for the observed susceptibility to Ad9. 

C. Attachment and Internalization of the Ad serotypes to K562 and CD34+ cells, 

METHODS 

Labeling of Ads with f 3 Hhmethvl thymidine: 

Serotypes were labeled with [ 3 H]-methyl thymidine as described in detail elsewhere 
(Roelvink, P. W.. et al. 1996. J. Virology. 70:7614-7621). Briefly, 5xl0 7 HeLa or 293 
cells were grown in 175 sq. cm flasks with 15 ml DMEM/10% FCS and infected with 
wild type adenovirus at a MOI of 50 or higher. Twelve hours post-infection, 1 mCi of 
[ 3 H]-methyl thymidine (Amersham, Arlington Heights, IL) was added to the media and 
cells were further incubated at 37°C until complete CPE was observed. Then, cells were 
harvested, pelleted, washed once with cold PBS, and resuspended in 5 ml PBS. Virus was 
released from the cells by four freeze-thaw cycles. Cell debris was removed by 
centrifugation and viral material was subjected to ultracentrifugation in CsCl gradients 
and subsequent dialysis as previously described (Lieber, A., C.-Y. et al. 1996. Journal of 
Virology. 70:8944-8960). Virus purification and dialysis removed unincorporated 
radioactivity. Wild type Ad particle concentrations were determined 
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spectrophotometrically by measuring the OD 2 6o, utilizing the extinction coefficient for 
wild-type Ad5 6 2 60= 9.09 x 10" 13 OD ml cm virion" 1 (Maizel, J. V., et al. 1968. Virology. 
36:115-125). The virion specific radioactivity was measured by a liquid scintillation 
counter and was always in the range of 1 x 10" 5 to 1 x 10^ cpm per virion. For selected 
5 variants, the fiber gene was PCR amplified and sequenced to ensure identity and the 
absence of cross-contamination. 

Viral DNA tapped with methvlase and test for replication by genomic Southern blots: 

10 To ultimately confirm transduction, a protocol to detect adenoviral replication in infected 
cells can be established. Viral DNA synthesis can only occur after de novo expression of 
adenoviral early genes. A site-specific methylation strategy is utilized to monitor viral 
DNA replication within infected cells (Nelson, J. et al. f 1997, Journal of Virology, 
71:8902-07). Methylation marked adenovirus can be produced by the addition of a 

15 methyl group onto the N6 position of the adenine base of Xhol sites, CTCGAG, by 
propagation of the virus in HeLa or A549 cells expressing the Xhol isoschizomer PaeR7 
methyltransferase (PMT) (Kwoh, T.J., et al., 1986, Proc. Natl Acad. Sci. USA 83, 7713- 
7717). It is known that methylation does not affect vector production but does prevent 
cleavage by Xhol. Loss of methylation through viral replication restores Xhol cleavage 

20 and can be detected by Southern blots of genomic DNA from infected cells in 
comparison to native, non-methylated, viral genomes. 

Attachment and internalization assays: 

25 These studies were performed based on a protocol published elsewhere (Wickham, T. J., 
et al. 1993. Cell. 73:309-319). In preliminary experiments, we found that labeled Ad5 
virions reached equilibrium in attachment to HeLa cells after 45 min at 4°C with an MOI 
of 400 pfu per cell. For attachment studies, 3.5 x 10 5 cells were incubated for one hour on 
ice with equal amounts of [ 3 H]-labeled adenovirus OD particles equivalent to an MOI of 

30 400 pfu/cell for Ad5 in 100 jil of ice-cold adhesion buffer (Dulbeco's modified Eagle's 
medium supplemented with 2 mM MgCl 2 , 1% BSA, and 20 mM HEPES). Next, the cells 
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were pelleted by centrifugation for 4 min at 1000 x g and washed two times with 0.5 ml 
ice-cold PBS. After the last wash, the cells were pelleted at 1500 x g, the supernatant was 
removed, and the cell-associated radioactivity was determined by a scintillation counter. 
The number of viral particles bound per cell was calculated using the virion specific 
5 radioactivity and the number of cells. To determine the fraction of internalized [ 3 H]- 
labeled adenoviral particles, cells were incubated on ice for one hour with the 
corresponding virus, washed with PBS as described above, resuspended in 100 uJ 
adhesion buffer, and then incubated at 37°C for 30 min. Following this incubation, cells 
were diluted 3-fold with cold 0.05% trypsin-0.5mM EDTA solution and incubated at 

10 37°C for an additional 5-10 min. This treatment removed 99% of attached radioactivity. 
Finally, the cells were pelleted at 1500 x g for 5 min, the supernatant was removed, and 
the protease-resistant counts per minute were measured. This protocol minimizes the 
possibility that the internalization data were affected by receptor recycling (Rodriguez, 
E., Everitt, E. 1999. Arch. Virol. 144:787-795). Nonspecific binding of Ad particles to 

15 cells on ice was determined in the presence of 100-fold excess of unlabeled virus. This 
value routinely represented less than 0.1 % of viral load. 

RESULTS 

20 Attachment of Ad particles to target cells and internalization: 

The selected serotypes were metabolically labeled with [ 3 H]-thymidine, which is 
incorporated into viral DNA during replication. Adsorption and internalization can be 
experimentally dissociated by taking advantage of the observation that at low temperature 

25 (0-4°C) only virus cell attachment occurs, whereas internalization requires incubation at 
higher temperatures. The number of particles adsorbed or internalized per cell was 
calculated using the virion-specific radioactivity and used to quantify interaction of Ads 
3, 4, 5, 9, 35, and 41 with CD34+, K562, HeLa and CHO cells (Figure 12). The serotypes 
varied significantly in their ability to attach to and to be internalized by the different cell 

30 lines. For Ad5, the degree of attachment to the cell lines tested correlated with the level 
of CAR expression. In CHO cells, which were previously shown to be refractory to Ad5 
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infection, the level of attachment and internalization was about 50-70 viral particles per 
cell. This number was hereafter assumed negative in terms of susceptibility of a given 
cell type for Ad5. Interaction of the other serotypes with CHO cells was not significantly 
higher indicating that corresponding receptor/s were absent on CHO cells. All serotypes 

5 tested interacted with HeLa cells; with Ad3 and Ad35 being the most efficient variants. 
The presence of distinct Ad3 and Ad5 receptors on HeLa cells was demonstrated 
previously (Stevenson, S. C, et al. 1995. J. Virology. 69:2850-2857). Ads 4, 5, and 41 
did not bind to K562 cells. In contrast, Ad9 as well as the members of subgroup B, Ad3 
and Ad35, efficiently interacted with K562 cells with Ad35 having the highest number of 

10 adsorbed and internalized particles. Compared to Ad5, about 25 times more Ad35 
particles were attached and three-forth of these were internalized by K562 cells. Viral 
interactions with CD34+ cells were generally weaker. Among the serotypes tested, only 
Ad9 and Ad35 were significantly internalized by non-cycling CD34+ cells. 
Internalization of Ad9 and Ad35 was, respectively, four and eight times more efficient 

15 than for Ad5 particles. The number of Ad35 virions internalized by CD34+ cells was 
almost half of that seen for Ad5 in HeLa cells, which can be readily infected with Ad5 
based vectors. 

Attachment and internalization of adenovirus serotypes 3. 5, 9. 35 and 41 into HeLa. 293. 
20 and CHO cells: 

Hela and 293 cells expressing high level of primary and secondary receptors for human 
adenoviruses are used as a positive control for virus attachment and internalization. As a 
negative control CHO cells are used. CHO cells do not express the primary adenoviral 

25 receptor at a detectable level, and are therefore refractory for adenoviral infection. For 
attachment studies, these adherent cell lines are detached from 10 cm dishes with PBS- 
EDTA solution (without Ca2+ and Mg2+), washed three times with ice-cold PBS, 
resuspended in adhesion buffer, and incubated with viruses as described above in the 
Examples section. As expected, all adenoviral serotypes tested are efficiently attached to 

30 and internalized into Hela cells (Table III) (Figure 13). Adenoviruses serotypes 3, 5, 35, 
41, but not 9, are efficiently attached to and internalized by 293 cells. In contrast, poor 
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attachment and internalization of most adenovirus serotypes are observed with CHO 
cells. The level of attachment on CHO is about 50-70 virus particles per cell for 
adenoviruses serotypes 5 and 41, 115 virus particles per cell for adenovirus type 3 and 
about 180 particles per cell for adenovirus serotypes 9 and 35. For further analysis, 
5 numbers >300 viral particles per cell are assumed as positive and <70 viral particles per 
cell as negative in terms of susceptibility of a particular cell line for efficient adenoviral 
transduction. 



TABLE III 

10 Comparative analysis of attachment and internalization of Ad5 and Ad9 
to cell lines, expressing different amounts of CAR and aofMntegrins. 



Cell line 


CAR expression 


ao|3-integrin 
expression 


Ad9 

(attached/ 
internalized) 


Ad5 

(attached/ 
internalized) 


HeLa 


++ 


++ 


426/370 


550/500 


CHO 




++ 


300/300 


70/50 


293 


++ 


++ 


20/20 


1950/1750 


Y79 


+++ 




190/140 


1200/1100 


K562 




+ 


320/230 


60/50 


Erythrocytes 


? 


? 


420/- 


68/- 



15 Attachment and internalization of adenovirus serotypes J. 5. P. 35 and 41 into human 
CD34+ bone marrow cells and K562 erythroleukemia cell line: 

Previous studies showed that the human erythroleukemia cell line K562 can be 
transduced with Ad5-based adenoviral vectors at very high MOIs. As shown in Figure 
20 14, only about 60 viral particles per cell of adenovirus serotype 5 are attached to and even 
fewer particles are internalized into these cells at a MOI of 400. In contrast to Ad5, about 
320 viral particles per cell of Ad9 and about 1500 viral particles per cell of Ad35 are 
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attached to and about two-thirds of them are internalized into K562 cells (Figure 14B). 
Human unstimulated CD34+-enriched bone marrow cells obtained from frozen stocks are 
incubated overnight in growth medium without cytokine stimulation. The next day, the 
number of viable cells is calculated. For attachment studies, cells are washed three times 

5 with ice-cold PBS, resuspended in adhesion buffer and incubated with adenoviruses. 
Among the adenoviral serotypes tested, only adenovirus particle of Ad9 (about 1 50 viral 
particles per cell) and Ad35 (about 320 viral particles per cell) are able to attach to 
unstimulated CD34+ cells on the level, compared to Ad 5 (only 60 viral particles per 
cell). Four-fifths of these virus particles are able to be internalized by the cells. 

10 Interestingly, upon stimulation of CD34+ cells with GM-CSF and EPO/TPO for two 
weeks, attachment and internalization of Ad9 viral particles are significantly increased 
(up to 300 particles per cell). At the same time, the transient stimulation of cells with 
GM-CSF for two days could not increase the level of viral attachment to the cells. 

15 Based on the above finding that Ad35 serotype is able to attach and internalize into 
CD34+ cells most efficiently among several serotypes tested, serotype Ad35 was selected 
for further studies. As described in Appendix II, a chimeric vector (Ad5 GFP/F35) 
containing the short-shafted Ad35 fiber sequence in an Ad5 capsid was able to target a 
broad spectrum of CD34+ cells in a CAR/integrin independent manner. 

20 

DISCUSSION 

In summary, from all the serotypes tested, Ad9, Ad3 ? and Ad35 demonstrated the most 
efficient attachment to and internalization with K562 and CD34+ cells. Based on 
25 adsorption/internalization data, Ad9 and Ad35 as representatives for subgroups D and B 
were selected for further tropism studies. 

D. Characterization of Ad vector replication in K562 and CD34+ cells. 

30 Comparative analysis of Ad5, and Ad9 and Ad34 to infect and to replicate in 293, K562 
and CD34+ cells. The ability of the Ad9 fiber knob domain to recognize the same 
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primary receptor on the cell surface as Ad5 with comparable affinity was described 
earlier. Thus, the finding that Ad9 viral particles can only poorly attach to 293 cells is 
rather unexpected. In order to find out how the attachment and internalization data 
reflect the biological activity of adenoviruses of different serotypes, the stocks of Ad5, 
5 Ad9 and Ad35 aire characterized in more detail by electron microscopy, plaque assay on 
293 cells, and quantitative replication assay in K562 and CD34+ cells. 

METHODS 
10 Quantitative replication assay: 

lxl 0 5 CD34+ or K562 cells were infected in \ 00\x\ of growth media with different MOIs 
of Ad5, 9 ? or 35 which had been amplified in 293 cells, expressing the Xhol DNA 
methyltransferase isoshizomer PaeR7 (Nelson, J., Kay r M.A. 1997. Journal of Virology. 

15 71 :8902-8907). After 2 hours of incubation at 37°C, the cells were centrifuged at 1000 x 
g for 5 min, the virus-containing medium was removed, the cells were resuspended in 
100^1 of fresh media, and then they were incubated at 37°C until harvesting. At 16 hours 
post-infection for K562 cells, or 36 h post-infection for CD34+ cells, 5 \x% of pBS 
(Stratagene, La Jolla, CA) plasmid DNA was added as a carrier which could also be used 

20 as a loading control. Genomic DNA was extracted as described previously (Lieber, A., 
C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). One-fourth of purified cellular 
DNA (equivalent to 2.5 x 10 4 cells) was digested with Hindlll, Xhol, or with Hindlll and 
Xhol together at 37°C overnight and subsequently separated in a 1% agarose gel followed 
by Southern blot with chimeric Ad5/9 or Ad5/35 DNA probes. The chimeric probes, 

25 containing sequences of Ad5 and Ad9 (Ad 5/9) or Ad5 and Ad35 (Ad 5/35), were 
generated by a two-step PCR amplification using Pfu-Turbo DNA polymerase 
(Stratagene, La Jolla, CA) and viral DNA from purified particles as a template. The 
following primers were used for PCR (Ad5 sequences and nucleotide numbers are 
underlined): Ad5Fl - (nt: 32775-32805) 5*- GCC CAA GAA TAA AGA ATC GTT 

30 TGT GTT ATG -3'; Ad5Rl - (nt: 33651-33621) 5*- AGC TGG TCT AGA ATG GTG 
GTG GAT GGC GCC A -3*: chimeric Ad5/9F - (nt: 31150-31177 , nt: 181-208 ) 5'- AAT 
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GGG TTT CAA GAG AGT CCC CCT GG A GTC CTG TCA CTC AAA CTA GCT 
GAC CCA -3'; chimeric Ad5/9R - (nt: 32805-32775 , nt: 1 149-1113) 5'- CAT AAC 
ACA AAC GAT TCT TTA TTC TTG GGC TTC ATT CTT GGG CGA TAT AGG 
AAA AGG-3; chimeric Ad5/35F - (nt: 31150-31177 . nt: 132-159) 5^- AAT GGG TTT 
5 CAA GAG AGT CCC CCT GG A GTT CTT ACT TTA AAA TGT TTA ACC CCA-3\ 
chimeric Ad5/35R (nt: 32805-32775 . nt: 991-958) 5 '- CAT AAC ACA AAC GAT TCT 
TTA TTC TTG GGC ATT TTA GTT GTC GTC TTC TGT A AT GTA AG-3\ 
Nucleotide numbers are given according to the sequences obtained from the NCBI 
GenBank (accession No. M73260 / M29978 for Ad5, X74659 for Ad9 ? and U 10272 for 

10 Ad35). After the first amplification, the 968 bp-long Ad9 ? a 859 bp-long Ad35 DNA 
fragments corresponding to the fiber genes, and a 876 bp-long Ad5 fragment 
corresponding to the Ad5 E4 region (located immediately downstream of Ad5 fiber gene) 
were purified by agarose gel electrophoresis. To generate chimeric DNA probes, 
amplified Ad5 DNA was mixed with Ad9 or Ad35 fragments obtained during the first 

15 step of PCR ? and subjected to a second PCR amplification using Ad5/9F or Ad5/35F 
primers and the Ad5Rl primer. The resulting Ad5/9 or Ad5/35 chimeric DNA fragments 
(see Fig. 15) were purified and their concentrations were measured 
spectrophotometrically. Corresponding chimeric DNA fragments were loaded as 
concentration standards on agarose gels or labeled with [ 32 P]-dCTP and used as probes 

20 for Southern analysis. The number- of viral genomes per DNA sample was calculated 
after quantitative Phospho-imager analysis. In preliminary experiments, no preferential 
hybridization of chimeric DNA probes to DNA of any particular viral serotype was 
detected. 

25 RESULTS 

Replication of selected serotypes in K562 and CD34+ cells: 

Adsorption/internalization studies do not ultimately prove viral transduction, a process 
30 often defined as gene transfer that allows for viral or heterologous gene expression in 
host cells. Intracellular trafficking, including endosomal lysis, transport to the nucleus, 
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and nuclear import of the viral genome, depends on structural capsid proteins and thus, 
varies between different serotypes (Defer, C, et al. f P. 1990. J. Virology. 64:3661-3673; 
Miyazawa ? et al. 1999. J. Virology. 73:6056-6065). We believed that analysis of viral 
gene expression would be a means to verify successful nuclear import of viral genomes 

5 and that this would be a good criterion for selection of serotype/s able to efficiently infect 
our target cells. To do this, we used a protocol, which allows for the detection of Ad 
replication in infected cells. Viral DNA synthesis can only occur after de novo expression 
of adenoviral early genes. We utilized a site-specific methylation strategy to monitor viral 
DNA replication within infected cells (Nelson, J., Kay, M.A. 1997. Journal of Virology. 

10 71:8902-8907). Methylated Ad serotypes were produced by the addition of a methyl 
group onto the N6 position of the adenine base of Xho I sites. CTCGAG. during 
propagation of the viruses in 293 cells expressing the Xho I isoschizomer PaeR 7 
methyltransferase (PMT) (Kwoh, T. J., et al. 1986. Proc.Natl.Acad.Sci. USA. 83:7713- 
7717) (293 PMTcells). Loss of methylation through viral replication restores Xho I 

15 cleavage and can be detected by Southern blots of Xho I-digested genomic DNA from 
infected cells. 

Ad replication studies were performed in K562 and CD34+ cells with Ad9 and Ad35, in 
comparison to Ad5. For replication studies, the infectious titer (in pfu/ml) and genome 

20 titer (in genomes per ml) were determined (by plaque assay on 293 cells or by 
quantitative Southern blot, respectively) for methylated and unmethylated Ad5, Ad9, and 
Ad35 (Table 2). The ratio of pfu to genome titer was comparable for methylated and 
unmethylated virus demonstrating that DNA methylation had not altered transduction 
properties. About 85% of (Ad5, 9, and 35) virus used for infection was methylated as 

25 calculated based on the intensity of fragments specific for methylated and non- 
methylated viral DNA present in the viral load (Fig. 15). The numbers of genomes 
detected after adsorption (1 hour, 0°C) or internalization (2 hours 37°C) correlated well 
with studies shown in Fig. 12. Ad9 and Ad35 interacted more efficiently than Ad5 with 
K562 and CD34+ cells. Dose-dependent replication studies in K562 and CD34+ cells 

30 were performed with the same genome numbers of Ad5, 9, and 35 (Fig. 15). The 
replication rate was measured based on the ratio of methylated to demethylated viral 
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DNA after infection with different MOIs (2100, 420, and 105 genomes per cell). In K562 
cells, efficient replication (100% conversion from methylated to unmethylated DNA) was 
detected for Ad5 at MOI >/= 2100, for Ad9 at MOl >/=420, and for Ad35 at MOI 
>/=105. This demonstrated that Ad35 transduced K562 cells with the highest efficiency. 

5 In CD34+ cells, the replication rate was 100% for Ad5 and 31% for Ad9 after infection 
with MOI 420. Although methylated Ad35 viral DNA was present in CD34+ cells, viral 
replication was undetectable for Ad35. In summary, while viral replication studies in 
K562 cells confirmed data obtained for Ad5, 9, and 35 adsorption and internalization, 
there was a discrepancy between earlier results and the poor replication of Ad9 and, 

10 particularly, Ad35 in CD34+ cells. As outlined later, replication analysis in 
heterogeneous cell populations, like CD34+ cells, may not allow for definitive 
conclusions on tropism of a particular serotype. 

Taking all the screening data together, Ad9 and Ad35 emerged as the variants with the 
15 strongest tropism for K562 and CD34+ cells. It is thought that Ad9 can bind to CAR, 
however, it preferentially uses a v -integrins for cell entry (Roelvink, P. W., et ah 1996. J, 
Virology. 70:7614-7621). This entry strategy may be not optimal for efficient infection of 
CD34+ cells as only less that 1 7% of them express a v -integrins (Fig. 1 0). Therefore, we 
decided to concentrate on Ad35 as a source for heterologous fiber to be used for 
20 construction of a chimeric vector based on an Ad5 backbone. 

TABLE IV 

Results from the infectivity assay which determines the optical 
particle-to-PFU (OPU/PFU) ratio using 293 cells 

25 



Virus 


OPU (A260) 


PFU 


OPU/PFU ratio 


Ad5 


1.4 x 10 IZ 


1.06 x 10" 


13 


Ad9 


4.61 x 10" 


2.6 x 10* 


1773 



DISCUSSION 
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Viral replication studies in K562 cells confirmed the data obtained for Ad5, 9, and 35 
adsorption and internalization. However, there was a discrepancy between the interaction 
data and the replication data in CD34+ cells where Ad9 replicated only poorly and no 
replication was seen for Ad35. Ad replication is only initiated upon the production of a 

5 critical threshold of early viral proteins, which in turn, is directly dependent on the 
number of viral genomes present in the nuclei of infected cells. Therefore, the outcome of 
replication studies may be affected by the rate of nuclear import of viral genomes, by the 
activity of viral promoters, and/or the intracellular stability of viral DNA/RNA. These 
parameters may vary, on one hand, between different subsets of CD34+, and/on on the 

10 other hand, between different Ad serotypes. In conclusion, the viral replication analyses 
performed with different Ad serotypes in CD34+ cells may not predict the actual 
transduction properties of chimeric vectors based on Ad5 backbone. This implies that 
attempts to produce gene transfer vectors based on Ad genomes other than Ad5 should be 
exercised with caution. 

15 

Recently, an Ad serotype screening strategy was used to identify variants with tropism 
for primary fetal rat CNS cortex cells or human umbilical vein endothelial cells. The 
optimal serotype (Ad 17) was selected based on immunohistochemistry for hexon 
production 48 hours after infection (Chilion, M., et al. 1999. 1 Virology. 73:2537-2540). 

20 However, this approach is problematic because, at least in our hands, antibodies 
developed against Ad5 hexon did not cross-react with other serotypes. Also, hexon is 
expressed only after onset of replication. As outlined above, the kinetics of intracellular 
trafficking, viral gene expression, and replication significantly vary between serotypes 
(Defer, C, et al., P. 1990. J, Virology. 64:3661-3673; Miyazawa, et al. 1999. J. Virology. 

25 73:60 56-6065). 

In addition to being the most efficient serotype in terms of interaction with CD34+ cells, 
Ad35 is also interesting because it interacts with receptor/s different from the Ad5 and 
Ad3. Ad35 and Ad5GFP/F35 attachment was not inhibited by Ad5 or anti-CAR 
30 antibodies suggesting that Ad35 binding was CAR independent. First, Ad5 did not 
compete with Ad35 and Ad5GFP/F35 during internalization and infection indicating that 
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<*c»P3/5 integrins are not involved in viral entry. Second, function-blocking antibodies 
against a v -integrins did not compete with Ad35 and Ad5GFP/F35 for internalization into 
K562 cells, whereas these antibodies did inhibit Ad5 internalization. And third, in 
contrast to Ad5 based vectors, GFP expression after infection with Ad5GFP/F35 was not 
5 restricted to a v -integrin-expressing CD34+ cells. From these facts, we conclude that 
infection with Ad35 and the chimeric Ad5GFP/F35 vector does not involve a v -integrins. 
In this context, the presence or absence of RGD motifs within Ad35 penton base remains 
to be determined by sequencing the corresponding genome region. Cross-competition 
assays demonstrated that Ad35 and Ad5GFP/F35 bind to a receptor that is different from 

10 the Ad3 receptor. Although Ad3 and 35 belong to the same subgroup, they have been 
divided into two DNA homology clusters. Bl and B2; the amino acids composing their 
fibers are only 60% homologous. Furthermore, the target tissues for both viruses are 
different; Ad3 can cause acute respiratory infections, whereas Ad35 is associated with 
kidney infection (Horwitz, M. S. 1996. Adenoviruses, p. 2149-2171. In B. N. Fields, 

15 Knipe, D.M., Howley, P.M. (ed.), Virology, vol. 2. Lippincott-Raven Publishers Inc., 
Philadelphia). Therefore, it was not surprising to see that Ad3 and Ad35 recognize 
different receptors. 

In conclusion, Ad35 and the chimeric vector enter the cells by a CAR- and a v -integrin 
20 independent pathway. We believe that Ad35 and the chimeric vector binds primarily to 
its fiber receptor and that this interaction is sufficient to trigger internalization. On the 
other hand, Ad35 internalization may involve cellular proteins other than <x v -integrins. 
These membrane proteins can overlap with those for Ad3 internalization and represent P2 
integrins, which protrude more from the cell surface than a v -integrins (Huang, S., et al. 
25 1996. J. Virology, 70:4502-4508). 

According to EM studies of negative contrast-stained adenoviral suspensions, the 
percentage of deficient particles for all adenoviral serotypes tested does not exceed 5%. 
However, plaque assays reveal that the ability to form plaques in 293 cells is significantly 
30 different for tested serotypes. The optical particle-to-PFU (OPU/PFU) ratio obtained is 
13 for Ad5, which is in good agreement with the previously estimated ratio for this 
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adenoviral serotype. Importantly, this ratio is about three times higher for adenovirus 
serotype 35 and more than 150-fold higher for adenovirus serotype 9. Furthermore, 
quantitative Southern blot using chimeric Ad5/9 and Ad5/35 DNA probes is used to 
determine the ratio between the genome and transducing titer. This study confirms the 
5 data obtained by plaque assay. Quantitative replication assay of these adenoviruses in 
K562 and CD34+ cells also confirms the ability of Ad9 and Ad34 to more efficiently 
attach to these cell types. The replication of viral genomes is observed for Ad9 and Ad34 
at lower MOIs of infection, compared to Ad5. In conclusion, the data obtained for 
different serotypes in attachment and internalization are in good agreement with the 
10 infectivity data in target cells. 

E. Attachment and internalization of different adenoviral serotypes into primary 
dendritic cells, JAWS1I. MCF-7 and REVC cells. 

15 As a proof of principle, the serotype screening strategy can be employed for other 
important target cells which are refractory to Ad5 infection. 

RESULTS 

20 RECV cells are endothelial cells which have to be targeted for approaches that are aimed 
to gene therapy of restenosis, atherosclerosis, inflammation etc. MCF-7 cells are breast 
cancer cells isolated from liver metastases which are important targets for tumor gene 
therapy. The human adenovirus serotypes 3, 5, 9, 35 and 41 are tested to see whether 
they can attach to and can be internalized by mouse primary dendritic cells, JAWSII 

25 cells, MCF-7-human breast cancer cells and REVC endothelial cells. None of the 
adenoviral serotype tested can efficiently attach to primary dendritic cells. Adenovirus 
serotype 3 is able to efficiently attach to REVC endothelial cells (about 400 virus 
particles per cell are attached and about 300 are internalized). In comparison, only 50 
Ad5 particles are able to attach to and even fewer are internalized in these REVC. The 

30 human breast cancer cells (MCF-7) are previously shown to be refractory to Ad5 
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infection at low MOIs. However, Ad3 and more efficiently, Ad35 attach to and 
internalize into MCF-7 cells. 

DISCUSSION 



The data presented herein indicate that different human adenovirus serotypes recognize 
different cellular receptors and can therefore infect cell types that are refractory to Ad5 
infection. There are adenoviral serotypes that can more efficiently attach and internalize 
than Ad5 for human CD34+ cells, REVC K562 and MCF-7 cells. This finding provides 
10 a basis for the construction of chimeric adenoviral vectors which are Ad5 vectors 
containing receptor ligands derived from other serotypes. 

F. Infection studies on primary human bone marrow cells. 

15 Since established erythroleukemic cell lines do not represent an adequate model for the 
ultimate hematopoietic stem cell that has to be targeted in patients in order to achieve 
long-term reconstitution with genetically modified cells, normal primary human bone 
marrow cells are used for the initial infection/retargeting studies. 

20 RESULTS 

In a first set of tropism studies with different Ad serotypes, whole bone marrow cell 
suspensions can be used without preselection. This is advantageous because the tropsim 
of various adenovirus serotypes or genetically retargeted vectors can be analyzed on a 
25 broad spectrum of progenitor subpopulations representing myeloid, erythroid, 
megakaryocytic, lymphoid, dentritic, and monocytic lineages. For short term (< 5 hours) 
infection studies, bone marrow suspensions can be cultured in IMDM supplemented with 
10% FCS, (J-mercaptoethanol, and lOu/ml IL-3 for ensuring cell viability. 



5 



30 
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Moncmucleated cell assays: 

Mononucleated bone marrow cells can be incubated with MOI 1, 10, 100, or 1000 
pfu/cell of the various adenovirus types for a short time. Paraffin sections or cytospins of 

5 infected bone marrow cells can be analyzed for nuclear-localized, labeled viral DNA. 
BrdU labeling can be visualized by immunoflouresence with anti-BrdU antibodies; 32 P- 
tagged viral DNA can be detected by incubation with photo-emulsion. In addition, the 
same cell material can be analyzed for morphology after specific histo-staining (e.g. 
Wright-, Hemo3 staining). If required, commercially available antibodies can be used to 

10 specific cell surface markers conjugated directly to different fluorochromes (FITC 
(green), TRIT. ? RPE, (red), RPE-Cy5 ? AMCA (blue)) to completely characterize infected 
bone marrow subpopulations. Colocalization of BrdU-labeled viral DNA (e.g. as FITC 
signal) with membrane markers signifying infection of specific cell types can be 
demonstrated; for example, potential stem cells/early progenitors (CD34 + , CD38~), 

15 megakaryocytes (CD41a+), eryhthroid cells (glycophorin A+) ? dentritic cells (CDla+), 
monocytes (CD14+), or myeloid cells (CD15+), etc. The morphological analysis of 
infected bone marrow subsets gives a first information whether specific adenovirus 
serotypes can target primitive cell types. 

20 DISCUSSION 

Since the different wild-type adenoviruses do not express a uniform marker gene and do 
not integrate and since detection of tagged viral DNA cannot be done on live cells, it is 
not possible, at this point, to characterize infected cells for clonogenic or repopulation 

25 capacities. Therefore, adenovirus serotypes for retargeting studies are selected, based on 
their ability to infect in vitro purified CD34+ cells at low MOIs. This subset of bone 
marrow cells is known to contain long-term reconstituting cells. Infection studies with 
different adenovirus serotypes can be repeated on purified CD34+ cells (cultured in 
IMDM +10% FCS, p"-mercaptoethanol, and 10 units/ml IL-3) as described above. 

30 Purification of CD34+ cells can be performed by direct immunoadherence on anti-CD34 
monoclonal antibody-coated plates or on MiniMacs columns as described by 
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Papyannopoulou (Papayannopoulou, T. et al.. 1996, Experimental Hematology^ 24:660- 
69; Papayannopoulou, T. et al., 1993, Blood, 81:229). The purity of isolated CDC34+ 
cells ranges routinely from 80-95%. Analgous infection studies can be repeated with 
selected adenovirus types on CD34+/CD38- subsets. 

5 

To confirm productive infection purified CD34+ cells can be infected with selected 
(methylase-tagged) serotypes and analyze viral DNA replication. Cultures of purified 
human bone marrow CD34+ cells can be used for the transduction and integration studies 
as a model for HSCs. 

to 

It was recently demonstrated that HSC activity does exist in CD34-negative human bone 
marrow subsets (Bathia, M. et al„ 1998, Nature Medicine, 4:1038-45; Osawa, M., et al., 
1996, Science, 273:242-5; Goodelh M. et al.. 1997, Nature Medicine, 3:1337-45: Zanjani, 
E. D. et al., 1998, Exp. Hematology, 26:353-60). Lin*CD34'38' cells can be tested in the 
15 retargeting and transduction studies in combination with repopulation assays in SCID- 
NOD mice. 

G. Cloning and insertion of the fiber gene. 

20 METHODS 

PCR-clonine of the corresponding fiber eene and insertion into Ad5 based shuttle 
plasmids instead of the endogenous ADS fiber: 

25 One or several adenoviruses with tropism to CD34+ or other HSC containing population 
is selected for further studies described herein. The complete coding region for fiber 
varies between l-2kb, depending on the virus type. The fiber encoding sequences can be 
obtained by PCR with Pfu polymerase from viral DNA isolated from purified particles of 
the selected virus types. The corresponding primers can be designed based on the fiber 

30 sequences available from the EMBL gene bank. The PCR products are cloned as Pacl- 
Ball fragment into pCD4 (Figure 10), a shuttle vector for recombination of RecA+ E. 
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coli. In pCD4, the heterologous fiber gene is flanked on both sides with Ad5 sequences, 
which are homologous to regions directly adjacent to the fiber reading frame in Ad5. As 
an Ad5 (shuttle vector) derived template for recombination, pCDl, a pBHG 10 
(Microbix, Toronto, Canada) derivative can be used. The recombination procedure is 

5 performed according to a protocol routinely used for recombinant adenovirus generation 
(Chartier ; C. ? et al. s 1996,7. of Virology, 70, 4805-4810). Routinely, 90% of the resulting 
plasmids are accurately recombined. The junctions between the heterologous fiber (X) 
and Ad5 sequences can be sequenced to confirm the accuracy of recombination. The 
resulting plasmid is named pAdSfiberX (pAd5 fx ). The resulting product is used to 

10 generate pAd5 fx -based Ad.AAV containing the heterologous fiber gene. 

Construction of chimeric Ad vectors: 

For transduction studies, two Ad vectors were constructed: AdSGFP and Ad5GFP/F35, 

15 containing a chimeric Ad5/35 fiber gene. Both adenoviral vectors contained a 2.3kb, 
CMV promoter driven EGFP gene [derived from pEGFP-1, (Clontech, Palo Alto, CA)] 
inserted into the E3 region of Ad5. The EGFP expression cassette was cloned between 
Ad5 sequences 25,191-28,191 and 30,818-32,507 into a shuttle plasmid, which contained 
the E3 deletion described for pBHGlO (Microbix, Toronto, Canada). The resulting 

20 plasmid was named pAdGFP. For the chimeric vector, the Ad5 fiber gene in pAdGFP 
was substituted by an Ad5/35 chimeric fiber gene generated by the two-step PCR 
protocol outlined above. In the first PCR step, three DNA fragments corresponding to i) 
the Ad5 fiber 5 ? -nontranslated region and the first 132 bp of the fiber tail domain (nt 
30,818-31,174), ii) the Ad35 shaft and knob domains (nt 132-991), and iii) the Ad5 E4 

25 region including the Ad5 fiber polyadenylation signal (nt 32,775-33,651 were amplified 
by Pfu-Turbo DNA polymerase. The following primers were used: for the Ad5 tail, 
Ad5F-2 (nt 30,798-30,825) 5'-CGC GAT ATC GAT TGG ATC CAT TAA CTA-3' and 
Ad5R-2 (nt 31,174-31,153) 5*-CAG GGG GAC TCT CTT GAA ACC CAT T-3'; for the 
Ad35 shaft and knob, primers Ad5/35F and Ad5/35R (see above); for the Ad5E4 and 

30 polyA, primers Ad5F-l and Ad5R-l (see above). After 10 PCR cycles, the products were 
purified by agarose gel electrophoresis, combined, and then subjected to a second PCR 
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with primers Ad5F-2 and Ad5R-l. The resulting 2115 bp-long chimeric fiber gene 
contained the Ad5 tail and the Ad35 shaft and knob domains. This product was used as a 
substitute for the Sall/Xbal Ad5 fiber gene containing fragment in pAdGFP. The 
resulting plasmid was named pAdGFP/F35. To generate full-length E1/E3 vector 

5 genomes, pAdGFP and pAdGFP/F35 were inserted in pAdHM4 (Mizuguchi, H., Kay, 
M.A. 1998. Human Gene Therapy. 9:2577-2583) by recombination in E.coli (Chartier, 
E. et al. 1996. Journal of Virology. 70:4805-4810). To do this, the RecA+ E.coli 
strain BJ5183 was co-transformed with pAdHM4 linearized by Srfl mixed with the Xbal 
fragments containing the GFP genes, the Ad5 or Ad5/35 fiber genes, and the Ad5 

10 homology regions. The resulting recombinants were analyzed by restriction analysis. 
Correct recombinants were amplified in E.coli HB101 and purified by double CsCl 
gradient banding. The plasmids were named pAd5GFP and pAd5GFP/F35. The correct 
structure of the Ad5/35 chimeric fiber gene was confirmed by endonuclease digestion and 
sequencing part of pAd5GFP/F35. To produce the corresponding viruses, pAd5GFP and 

15 pAd5GFP/F35 were digested with Pacl to release the viral genomes and transfected onto 
293 cells as described (Lieber, A., C.-Y. et al 1996. Journal of Virology. 70:8944-8960). 
Plaques developed 7 to 10 days post-transfection in overlayed cultures. Recombinant 
viruses were propagated in 293 cells and purified by standard methods described 
elsewhere (Lieber, A., C.-Y. et al. 1 996. Journal of Virology. 70:8944-8960). 



Hemazzlutinaiion assay: 

Twenty-five microliters of serial dilutions of Ad5, Ad35, or chimeric Ad5GFP/F35 
virions in Mcllvaine-NaCI buffer (0.1 M citric acid, 0.2 M Na 2 HP0 4 [pH 7.2], diluted 
25 1:50 with 0.87% NaCl) were loaded onto 96 well plates. To each dilution, 25 \x\ of a 1% 
suspension of monkey erythrocytes (in Mcllvaine-NaCI buffer) was added. The 
sedimentation pattern was determined after incubation for 1 hour at 37°C. All tests were 
performed in quadruplicates in at least two independent experiments. 



20 



30 
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Southern blot: 

Extraction of genomic DNA, labeling of DNA fragments and hybridization were 

performed as described earlier (Lieber, A., C.-Y. et al. 1996. Journal of Virology. 
5 70:8944-8960). 

RESULTS 

Construction/Characterization of chimeric fiber: 

10 

Previously, it has been shown that exchanging the fiber knob was sufficient to alter the 
tropism of chimeric Ad vectors (Chillon, M., et al. 1999. J. Virology. 73:2537-2540; 
Krasnykh, V., et al. 1998. J. Virology. 72:1844-1852; Stevenson, S. C, et al. 1997. 
J. Virology. 71:4782-4790). As outlined above, the length of the fiber shaft may critically 

15 determine the entry strategy of a particular serotype. Therefore, we decided to replace not 
only the Ad5 fiber knob but also the shaft. The chimeric Ad5/35 fiber contained the Ad5 
tail (amino acid: 1-44) necessary for interaction with the Ad5 penton base linked to 279 
amino acids from Ad35 including the shaft with 7 P-sheets and the knob (Fig. 16A). The 
endogenous Ad5 fiber polyA signal was used to terminate transcription of the chimeric 

20 fiber RNA. The combination of the Ad5 capsid including the RGD motif containing 
penton base with a short-shafted fiber could be risky because the natural distance 
between the fiber knob and the RGD motifs was disturbed. The Ad5 fiber was 
substituted by the chimeric fiber sequences based on an E1/E3 deleted Ad vector. This 
vector carried a CMV promoter-GFP reporter gene cassette inserted into the E3 region. 

25 The corresponding chimeric virus (Ad5GFP/F35) was produced in 293 cells at a titer of 
>2xl0 12 genomes per ml. For comparison, an E1/E3 deleted Ad vector containing the 
original Ad5 fiber gene and the GFP expression cassette was generated (Ad5GFP). The 
titer and the ratio of physical to infectious particles was similar between AdSGFP and 
Ad5GFP/F35 indicating that the fiber modification did not significantly alter the stability 

30 and/or growth properties of the chimeric vector. The correctness of the fiber modification 
was confirmed by restriction analysis of the Ad5GFP/F35 viral genome followed by 
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Southern blot hybridization (Fig. 16B). direct sequencing of the fiber-coding region, and 
a functional test for hemagglutination (HA) of monkey erythrocytes. The agglutination of 
erythrocytes is fiber knob-mediated; it is known that Ad5 does not agglutinate monkey 
erythrocytes whereas Ad35 efficiently does (Pring-Akerblom, P., et al. 1998. J. Virology, 
5 72:2297-2304). In HA tests, Ad5GFP/F35 agglutinated monkey erythrocytes with the 
same efficiency as Ad35 at dilutions of up to 1:512. In contrast, no hemagglutination was 
observed with equivalent Ad5 dilutions. This clearly confirmed the functional activity of 
the chimeric Ad5/35 fiber incorporated into Ad5 capsid. 

10 Generation of chimeric adenoviral vectors (Ad.AAV rx ) with heterologous fiber 
molecules: Adenoviruses with chimeric Ad5-Ad3 fiber are viable and can be produced at 
high titers (Krasnykh, V., et al., 1996, J. of Virology, 70, 6839-6846; Stevenson, S. C. et 
aL 1997, J. Virology, 71:4782-90). In order to test whether the fiber substitution 
described herein affects production or stability of adenoviruses, two El -deleted first- 

15 generation, adenoviral vectors are produced with the AAV- □ gal cassette in 293 cells 
using standard protocols. The vector is generated by recombination of pAd.AAV-BG 
(Fig. 17) with pCDl (containing the endogenous Ad5 fiber); the other vector (with 
heterologous fiber) is the recombination product of pAd.AAV [JGal and pAdSfiberX 
(pAd5 fx ). Virus from single plaques is amplified on 293 cells. The production yield per 

20 293 cell can be determined by plaque-titering of 293 cell Lysates. It is anticipated that 
the fiber modification will not critically affect the stability of chimeric vectors. Finally, 
bone marrow cells can be infected with the retargeted vectors. Two days after infection, 
live-cell cytometry is performed for P-gal expression using as substrate Fluorescein di-ED- 
D-Galactopyranoside (FDG) (Cantwell, M.J. et al., 1996 Blood 88, 4676-4683; Neering, 

25 S. etal., 1996, Bloody 88:1147-55; Fiering, S. N. et al., 1991, Cytometry 12:291 ; Mohler, 
W. et al., 1996, PNAS, 93:57) and the infected cells are characterized for morphology and 
surface markers. Before and during infection, bone marrow cells can be cultured in 
IMDM/FCS supplemented with thrombopoietin (Tpo), which supports the survival of 
HSC (Matsunaga, T. et al., 1998, Blood, 92:452-61; Papayannopoulou, T. et al., 1996, 

30 Experimental Hematology, 24:660-69). Alternatively, retargeted vectors can be 
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generated with the AAV-GFP (green fluorescence protein) cassette and perform FACS 
analysis on transduced cells based on GFP and surface marker expression. 

H. Competition studies of chimeric fiber protein Ad5/35. 

5 

Competition studies: 

Cross-competition studies between Ad5. 35. and Ad5GFP/F35 (Fig. 18) for binding and 
internalization were performed in order to investigate in more detail the pathways which 

10 are used by the chimeric vector to infect target cells. Wild-type Ad35 and the chimeric 
vector Ad5GFP/F35 could recognize the same primary receptor as they competed with 
each other for the attachment to K562 cells (Fig. 19A, upper panel). This primary 
receptor is different from that used by Ad5, since neither Ad5 viral particles nor anti- 
CAR monoclonal antibodies (Fig. 19B, upper panel) were able to abrogate Ad35 or 

15 Ad5GFP/F35 binding. In competition studies for internalization, Ad35 and Ad5GFP/F35 
competed with each other with equal efficiency. Ad5 and anti-a v -integrin monoclonal 
antibodies (L230) (Figs. 19C, D; lower panel) did not inhibit internalization of Ad35 or 
the chimeric virus. To consolidate this data, K562 cells were infected with Ad5GFP and 
Ad5GFP/F35 after prior incubation of cells with anti-CAR or anti-a v -integrins 

20 monoclonal antibodies followed by analysis of GFP-expressing cells. The transduction 
data mirror the results obtained in adsorption/internalization studies. In summary, this 
demonstrated that Ad35 and Ad5GFP/F35 use a CAR and a v -integrin-independent 
pathway for infection of K562 cells; the structural elements which account for these 
specific properties are located within the Ad35 fiber and can be transplanted into Ad5 by 

25 fiber substitution. 

Ad3 can efficiently interact with K562 cells (Fig. 12), although Ad3 and Ad35 belong to 
the same subgroup (B), the homology between amino acid sequences of their fibers is 
only about 60%. Therefore, we decided to test whether Ad3 could compete with Ad35 
30 and Ad5GFP/F35 for attachment and internalization (Fig. 20). These studies 
demonstrated that Ad35 binding was not inhibited by Ad3 indicating the use of different 
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receptors. Interestingly, Ad3 slightly inhibited attachment of Ad5GFP/F35 (Fig. 20A, left 
panel). In addition to binding to the receptor common for the Ad35 and Ad5GFP/F35 
fiber, the chimeric capsid (e.g. the Ad5 penton RGD motifs) may also interacts with a 
second cellular receptor that overlaps with elements involved in Ad3 binding. In cross- 

5 competition for internalization, pre-incubation of cells at 37°C with Ad35 and with 
chimeric virus significantly decreased internalization of [ 3 H]-labeled Ad3 (Fig. 20D, right 
panel). In the reverse experiment, Ad3 as competitor decreased the level of 
internalization by 30% for both. Ad35 and the chimeric virus (Fig. 20B, right panel). As 
expected, Ad5 and Ad3 did not compete for adsorption or internalization. As shown 

10 before (Fig. 19B). anti-CAR and anti-ct v -integrin antibodies did not block Ad3 interaction 
with K652 cells. In summary, we concluded that Ad35 and Ad5GFP/F35 bind to 
receptor/s different from that of Ad3. although they can use common structural elements 
for internalization, which are different from ot v -integrins. 

15 Infection studies with chimeric virus : 

It is established that Ad5GFP/F35 infected K562 cells by a CAR and a v -independent 
pathway. It is possible that this property allows for efficient transduction of non-cycling 
CD34+ cells, which express scarcely CAR and a v -integrins. To test this, the transduction 

20 properties of AdSGFP and Ad5GFP/F35 vectors were analyzed on CD34+ cells, K562, 
and HeLa cells. Fig. 21 shows the percentage of transduced, GFP expressing cells 
depending on the MOI used for infection. Nearly 100% of HeLa cells were transduced 
with AdSGFP and Ad5GFP/F35 at MOIs of >/=25. More than 95% of the K562 cells 
were transduced with Ad5GFP/F35 at MOIs of >/= 100, whereas the transduction rate 

25 was significantly lower with Ad5 where it increased with the MOI reaching a plateau at 
-70% GFP-positive cells after infection with an MOI of 400. Transduction of CD34+ 
cells was about three fold more efficient with Ad5GFP/F35 than with Ad5GFP at all 
MOIs analyzed. Interestingly, at higher MOIs, the transduction rate did not rise 
proportionally with the viral dose and soon reached a plateau indicating that in both cases 

30 only specific subset/s of CD34+ cells were permissive to infection. In order to 
characterize in more detail these specific, permissive subset/s, additional transduction 
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studies were performed. First, the percentage of GFP expressing cells was determined in 
CD34+ fractions that were stained for ct v -integrins or CARs (Fig. 22). The low number of 
CAR positive CD34+ cells complicated accurate co-labeling studies, and there was no 
correlation between CAR expression and the proportion of transduced cells among 

5 CD34+ cells infected with AdSGFP or Ad5GFP/F35. Interestingly, for AdSGFP, 65% of 
all GFP expressing cells were positive for cc v -integrins, whereas less than 22% of GFP 
positive cells infected with the chimeric virus stained positive for n v -integrin expression. 
While only 17% of the whole CD34+ population expressed GFP after AdSGFP infection, 
the percentage of GFP-expressing cells in the CD34+/ Dv-integrins positive fraction was 

10 50%. This indicates that Ad5GFP vector-mediated GFP expression was preferentially 
localized to a v -integrin positive CD34+ subsets, whereas after infection with the 
Ad5GFP/F35 vector. GFP was expressed in a broader spectrum of CD34+ cells with 
most of them being cu-integrin-negative. 

15 Next, transduced cells were simultaneously analyzed for GFP as well as for CD34 and 
CD1 17 markers. As mentioned before, only about 90% of all cells used in our analysis 
were positive for CD34 at the time of infection, hence the multiparameter analysis for 
CD34 and GFP. A population of CD34+ cells is extraordinarily heterogeneous in 
morphology and stem cell capacity. The subpopulation of CD34+ and CD117+ cells 

20 resembles very primitive hematopoietic cells (Ikuta, K. t Weissman, LL. 1992 Proc. Natl. 
Acad. Sci. USA. 89:1502-1506; Simmons, P. 1, et al 1994. Expl. Hematology. 22:157- 
165). Fig. 23 summarizes the analyses of GFP expression in correlation with these 
specific stem cell markers. While 54% of cells infected with chimeric vector were 
positive for GFP and CD34+, only 25% of cells infected with Ad5GFP expressed the 

25 transgene and CD34+ marker (Fig. 23A, lower panel). More importantly, based on GFP 
expression, the chimeric virus transduced 80% of c-kit positive cells, whereas the AdS- 
based vector transduced only 36% (Fig. 23 A, middle panel). In an additional experiment, 
CD34+ cells were sorted for CD117 expression prior to infection with AdSGFP or 
Ad5GFP/F35 and, 24 hours post-infection, GFP expression was analyzed in this specific 

30 fraction (Fig. 23B). This analysis revealed that the chimeric vectors transduced 4 fold 
more CD34+/CD1 17+ than the AdSGFP vector. 
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In conclusion, these results demonstrated that the chimeric Ad5GFP/F35 vector was 
clearly superior to the AdSGFP vector in targeting and transduction of CD34+ cells. 
Furthermore, the data suggest that the spectrum of CD34+ cell subsets permissive for Ad 
5 infection was significantly different for the chimeric vector than for the Ad5 vector. 

Analysis of viral genomes within CD3^+ cells infected with the Ad5 and chimeric 
vectors: 

10 So far. the transduction rate of CD34+ cells was measured based on GFP expression after 
infection with AdSGFP and Ad5GFP/F35. Considering the extraordinary heterogeneity of 
CD34+ cells in morphological and functional parameters, GFP may not be expressed in 
all cell types that were efficiently infected. Reasons for this include that the CMV 
promoter may not be active in all cell types or that the regulation of transgene expression 

15 could differ between subsets on a post-transcriptional or post-translational level. To test 
this, we quantified the number of intracellular (transduced) viral genomes within GFP 
positive and GFP negative fractions of CD34+ cells infected with AdSGFP and 
Ad5GFP/F35. To do this, twenty-four hours after infection, CD34+ cells were sorted for 
GFP positive and GFP negative fractions, which were subsequently used to isolate 

20 genomic DNA together with transduced viral DNA. The number of viral genomes was 
determined by quantitative Southern blot as described for Fig. 15. Per GFP-positive 
CD34+ cell, about 270 copies of the Ad5GFP/F35 viral genome were detected. 
Interestingly, a remarkable 200 copies of the Ad5GFP/F35 viral genome were found per 
GFP-negative CD34+ cell (Fig. 24 A and 25). This demonstrated that not all infected cells 

25 expressed GFP and implies that the actual transduction rate was higher than 54% (GFP- 
positive cells). We concluded that the CMV promoter was not active in all transduced 
CD34+ subsets. No Ad5GFP vector specific signal was detected within infected CD34+ 
(GFP positive or negative) fractions by Southern blot which had a detection limit of 14 
viral genomes per cell. From this, we can conclude that the vector DNA concentration per 

30 transduced cell was at least 20 times higher for Ad5GFP/F35 than for AdSGFP. 
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AdSGFP DNA was only detectable in DNA samples from infected CD34+ cells by 
Southern blot after prior PCR amplification with vector specific primers (Fig. 24B and 
25). This indicates that the replication deficient Ad5 vector is present but at a very low 
copy number, which may be limited by intracellular genome stability. Using the PCR- 

5 Southern detection method, Ad5 vector DNA was also detected in GFP negative cells, 
supporting that the CMV promoter may not have been the optimal choice for transduction 
studies. It is notable that studies by others on viral genome analyses after infection of 
CD34+ cells with Ad5 vectors were performed only after prior PCR amplification 
(MitanL K. ? et al. 1994. Human Gene Therapy. 5:941-948; Neering, S. J., et al. 1996.. 

10 Blood. 88:1147-1155). 

DISCUSSION 

The chimeric Ad5GFP/F35 vector has binding and internalization properties similar to 

15 Ad35. Therefore, the fiber substitution was sufficient to swap cell tropism from Ad5 to 
Ad35. The Ad5GFP/F35 capsid chimera contained the short-shafted Ad35 fiber 
incorporated into an Ad5 capsid, instead of the naturally occurring long-shafted Ad5 
fiber. During Ad5 infection, interaction between the penton base and intergrins is 
required to induce viral internalization. For this interaction, the length of fiber shaft and 

20 the precise spatial arrangement of knob and RGD motifs are critical for the virus entry 
strategy. The natural spatial arrangement is disturbed when short-shafted heterologous 
fibers are inserted into the Ad5 capsid. Interestingly, the Ad5/35 capsid chimera allows 
for efficient infection, suggesting that the protruding RGD motives in the Ad5 penton 
base do not affect the interaction with the primary Ad35 receptor. So far, most of the 

25 chimeric viruses were generated by substituting only the Ad5 knob while maintaining the 
long Ad5 fiber shaft (Chillon, M., et al. 1999. J. Virology. 73:2537-2540; Krasnykh, V. 
N., et al. 1996. J.Virology. 70:6839-6846; Stevenson, S. C, et al. 1995. J. Virology. 
69:2850-2857; Stevenson, S. C, et al. 1997. J.Virology. 71:4782-4790). The exception 
was an Ad5/7 chimeric virus (Gall, J„ et al. 1996. J. Virology. 70:21 16-2123), where the 

30 whole Ad5 fiber was substituted by the short-shafted Ad7 fiber. However, similar to the 
parental Ad5, the Ad5/7 chimera still required ct v -integrins for infection. 
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This Ad5GFP/F35 chimera is the first demonstration that despite the presence of RGD 
motifs within the Ad5 penton, the chimeric virus uses cell entry pathways determined 
primarily by the receptor specificity of the short-shafted heterologous fiber. This does not 

5 exclude that interaction with a secondary receptor may increase binding affinity. The 
latter is supported by the observation that Ad35 and Ad5GFP/F35 slightly differed in 
their ability to compete with Ad5 or Ad3 for binding. It is possible that Ad5/35 
attachment involves, in addition to the high affinity fiber binding, interaction between 
Ad5 capsid proteins (e.g. RGD motifs) and secondary receptor/s that overlap with those 

10 used by Ad3 and Ad5. 

This data indicate that infection with Ad5-based vectors is restricted to a specific subset 
of CD34+ cells. The percentage of GFP expressing cells after AdSGFP infection of 
CD34+ cells reached a plateau at MOIs higher than 100 indicating that only a limited 

15 fraction of CD34+ cells was permissive to Ad5. Also, strong replication of wild type Ad5 
in infected CD34+ cells may be the result of preferential transduction of a specific 
subpopulation of CD34+ resulting in a expression of early viral genes at a level sufficient 
to initiate viral replication. The presence of a specific subpopulation of CD34+ cells 
permissive to Ad5-vector infection was suggested by others (Byk, T., et al. 1998. Human 

20 Gene Therapy. 9:2493-2502; Neering, S. J. ? et al. 1996.. Blood. 88:1147-1155). In the 
present report, we further characterized this subpopulation and demonstrated that Ad5- 
based vectors preferentially infected a v -integrin positive CD34+ cells. Integrins 
(includingotv) are thought to be important for homing and trafficking of transplanted 
hematopoietic cells, however little is known about the correlation between a v -integrin 

25 expression and the differentiation status of hematopoietic cells (Papayannopoulou, T., 
Craddock, C. 1997. Acta Haematol. 97:97-104; Roy, V., Verfaillie, CM. 1999. Exp. 
Hematol. 27:302-312). There was no clear correlation between CAR and GFP expression 
suggesting that Ad5GFP may be able to use another membrane protein as a primary 
receptor. Alternatively, Ad5GFP transduction observed at an MOI of 200-400 could be 

30 the result of direct interaction between virus and a v -integrins triggering internalization, 
which may be the preferred pathway in the absence of CAR (Legrand, V., et al. 1999. J. 



92 




WO 00/73478 PCTAJSOO/15442 

Virology. 73:907-919). Importantly, infection with the chimeric Ad5GFP/F35 vector was 
not restricted to the a v -positive CD34+ subpopulation. 

Among CD34+ cells, the subpopulation of CD34+ and CD117+ cells resembles very 
5 primitive hematopoietic cells (Ikuta, K., Weissman, I.L. 1992 Proc. Natl. Acad. Sci. USA. 
89:1502-1506; Simmons, P. J. s et al. 1994. Expl. Hematology. 22:157-165). The receptor 
for stem cell factor, CD1 17 (c-kit) belongs to a tyrosine kinase family. It was previously 
shown that c-kit+, CD34+ cord blood cells contain a high fraction (16%) of 
hematopoietic progenitors (Neu, S., et al. 1996. Leukemia Research, 20:960-971). Early 
10 in ontogeny 34+/CD117+ cells have long-term repopulating activity (Sanchez, M. J., et 
al. 1996. Immunity. 5:513-525). An average of 50-60% of CD34+ cells are reported to be 
CD117 positive (Ikuta, K. ? Weissman, I.L. 1992 Proc. Natl Acad. Sci. USA. 89:1502- 
1506; Neu, S. ? et al. 1996. Leukemia Research. 20:960-971; Simmons, P. J., et al. 1994. 
Expl. Hematology. 22:157-165). In our studies, the chimeric vector expressed GFP in 
15 54% CD34+ cells and 80% of CD34+/c-kit+ cells. The actual viral transduction rate 
could be even higher because transduced Ad5GFP/F35 vector DNA was also found in 
GFP-negative fractions of infected cells. This indicates that the CMV promoter used to 
drive GFP expression in our vectors was not active in all transduced cells. We selected 
the CMV promoter for transgene expression based on published data demonstrating that 
20 PGK and CMV promoters allowed for efficient transgene expression in CD34 cells 
whereas the HTLV-I and RSV promoter were almost inactive (Byk, T. ? et al. 1998. 
Human Gene Therapy. 9:2493-2502; Case, S. S., et al. 1999. Proc. Natl Acad. Sci. USA. 
96:2988-2993). On the other hand, studies by Watanabe et al. (Watanabe, T., et al. 1996. 
Blood 87:5032-5039) suggest that the CMV promoter is not active or rapidly silenced in 
25 certain CD34+ subsets. Our data underscore this observation. Considering retroviral 
transduction studies, the retroviral MLV promoter may have been a better candidate for 
transduction studies in hematopoietic cells (Bregni, M., et al. 1998. Gene Therapy. 5:465- 
472). 

30 After having demonstrated that the Ad5GFP/F35 vector efficiently transduced cells 
carrying stem cell specific markers, the next logical step would be to perform colony 



93 




WO 00/73478 PCT/USOO/15442 

assays with pre-sorted GFP positive/negative cells. However, this assay is complicated by 
the fact that infection with first generation Ad vectors is cytotoxic and affects the 
formation and growth of progenitor colonies in MC-cultures (Mitani, K., et al. 1994. 
Human Gene Therapy. 5:941-948; Watanabe, T., et al. 1996. Blood. 87:5032-5039). This 

5 side effect is caused by the expression of Ad proteins within transduced cells (Lieber, A., 
C.-Y. et al. 1996. Journal of Virology. 70:8944-8960; Schiedner, G., et ah 1998. Nature 
Genetics. 18:180-183; Yang, Y. ? et al. 1994. Proc. Natl Acad. ScL USA. 91:4407-4411). 
Some of these proteins (e.g E4-orf4, pTP, or E3- 11.6k) have pro-apoptotic activity 
(Langer, S. J., Schaak, J. 1996. Virology. 221:172-179; Lieber, A. ? et al. 1998. J. 

10 Virology. 72:9267-9277; Shtrichman, R. r Kleinberger, T. 1998. J. Virology. 72:2975- 
2983; Tollefson, A. E., A et al. 1996 J. Virology. 70:2296-2306). Clearly, this would 
affect the outcome of transduction studies with Ad5GFP/F35, which allows for the 
efficient transfer of viral genomes into CD34+ cells implying significant expression of 
viral proteins. Moreover, recently published data indicate that short-term colony assay 

15 mostly measure mature progenitors and do not represent a rigorous test for transduction 
of potential stem cells. 

A definitive demonstration that Ad5GFP/F35 based vectors can transduce HSC requires 
colony assays or preferably, repopulation assays in SC1D-NOD mice. We can perform 

20 these studies with gutless vectors (Steinwaerder, D. S., et al. 1999. J Virol 73:9303-13) 
and integrating □ Ad. AAV vectors devoid of all viral genes (Lieber, A., et al. 1999. J 
Virol 73:9314-24) generated based on Ad5GFP/F35 chimeric capsids. Alternatively, 
gutless, retargeted vectors could be used to transiently express a retroviral receptor on 
CD34+ cells to increase their susceptibility to infection with retroviral vectors based on 

25 an approach that we have published earlier (Lieber, A., et al.. 1995. Human Gene 
Therapy. 6:5-1 1). 

Our finding that Ad5GFP/F35 can efficiently transduce hematopoietic cells with potential 
stem cell capacity represents an important step towards stable gene transfer into HSCs 
30 and gene therapy of blood disorders. Furthermore, the virological aspects of this 
invention contribute to a better understanding of adenovirus cell interactions. 
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I: Retargeting of Ad5 based vectors with modified fibers carrying specific ligand 
peptides for HSC and other cell types 

5 Another alternative to make Ad5-capsid-based vectors suitable for HSC gene therapy is 
to incorporate the coding sequence for HSC specific peptides into the HI loop region of 
the AdS fiber gene. The modification of the HI -loop was successfully exercised by 
Krasnykh et al. with a 7 amino-acid long FLAG peptide (DYDDDDK). Using phage 
display peptide libraries (Pascqualini, R. et al., 1996, Nature, 380:364-66), Renata 

10 Pasqualini (La Jolla Cancer Research Center) reported recently, at the First Meeting of 
the American Society for Gene Therapy, the identification of small peptide ligands 
specific for bone marrow cells. The corresponding sequences encoding these peptides 
can be added to modify the HI loop sequence employing site-directed mutagenesis. 
Optimally, the ligands should allow for the efficient internalization of adenoviral particles 

15 based on a CAR- and integrin independent pathway. Modified adenoviral vectors 
containing the AAVBG cassette can be produced and tested for HSC tropism as 
described above. 

Adenovirus peptide display: 



In order to retarget adenoviruses to any cell type of interest, a strategy is provided which 
involves creating a library of adenoviruses displaying random peptides in their fiber 
knobs as ligands and screening this library for adenovirus variants with tropism to a 
particular cell type in vitro and potentially in vivo. 



The development of the adenovirus peptide display technique is based on the following 
ideas, (i) Although the tertiary structure of the AdS fiber knob is known, it remains 
unclear which domains are involved in receptor binding. There are data suggesting that 
receptor-binding domains partially overlap with hemagglutination domains, which are 
30 well characterized for a number of serotypes. Therefore, three intramolecular loop 
regions representing potential receptor binding sites can be substituted by random peptide 



20 
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libraries. Eight amino acid residues in the center of the FG, or GH loops can be 
substituted by octameric random peptides (Figs. 26 and 27). These substitutions will 
replace CAR tropism and allow for infection of refractory cell types, (ii) To synthesize 
the oligonucleotides encoding the peptide library a novel technique to assemble pre- 

5 synthesized trinucleotides representing the codons for all 20 amino acids is employed. 
This avoids termination codons and assures optimal codon usage and translation in 
human cells. Synthesis of a completely randomized library is possible with all 20 amino 
acids being incorporated with the same probability and a partially randomized library 
with only three (in average) random amino acids substitutions per octamer at random 

10 positions with a random amino acid to maintain certain critical features of the tertiary 
knob structure while introducing variability. The last model is based on the distribution 
of amino acids present in the hypervariable CDR 1 or 2 region of immunoglobulins, (iii) 
To maintain a representative library size of about 10 10 different octamers per modified 
loop, a new cloning strategy is employed to allow for insertion of the library into the 

15 wild-type Ad5 genome without introducing additional amino acids at the substitution site 
and without transformation into bacteria. This strategy is based on a "seamless" cloning 
technique available from Stratagen. (iv) In order to produce the library of viruses, viral 
genomic DNA containing the modified fiber sequences is transfected into 293 cells 
without reduction of the library size. This critical step is done by conjugating the viral 

20 library DNA to carrier Ad5-based adenovirus via poly-lysine to assure 100% transfection 
efficiency. This technique allows for coupling of —l|ig of plasmid DNA (or ~lxl0 10 
adenoviral genomes) to 10 10 viral particles which can be used to infect 293/cre cells at an 
MOI of 10-100. Importantly, the carrier adenoviral genome has the packaging signal 
flanked by lox sites preventing the packaging of carrier viral DNA after infection of 293 

25 cells that express ere recombinase (293/cre). This helper virus system is routinely used to 
produce so-called gutless adenoviruses. Therefore, the virus progeny represents library 
genomes packaged into capsids containing preferentially Ad5 fibers. This is important 
for the next infection step into 293 cells at a MOI of 1 to assure a homogeneous fiber 
population on the capsid where the fibers are encoded by the packaged genome. 

30 

J. Production Of Adenovirus Vectors With Increased Tropism To Hepatocytes 
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An example of a G-H loop substitution to target Ad5 to hepacytes was successful. 
Preliminary tests demonstrated that two evolutionarily conserved regions within the 
malaria circumsporozoite surface protein (CS) termed RI and RII+ mediate specific 

5 interaction with hepatocytes but not with other organs (including spleen, lung, heart and 
brain), nor with Kupffer cells, liver endothelial cells or with other regions of the 
hepatocyte membrane (CeramL C. et aL 1992 , Cell 70:1021-33; Shakibaei, M. and U. 
Frevert, 1996 . J. Exp. Med., 184:1699-711). These regions are conserved among 
different species including Plasmodium berghei, P. cynomogli, and P. falciparum that 

10 infect mouse, monkey and human hepatocytes, respectively (Cerami, C. et al., 1992 , Cell 
70:1021-33; Chatterjee, S. et al., 1995 . Infect Immun.. 63:4375-81). Peptides derived 
from RI (KLKQPG) or R1I (EWSPCSVTCGNGIQVRIK) blocked CS binding to 
hepatocytes and infection by sporozoites in vivo ((Cerami. C. et al., 1992 . Cell. 70:1021- 
33; Chatterjee, S. et al., 1995 . Infect lmmim. t 63:4375-81). RI and RII+ peptides were 

15 separately inserted into Ad5-fiber knob (H-I and G-H loop) containing mutation with 
abolished binding to CAR and alpha-v integrins (Kirby, L. et al., 2000 . J. Virol, 74:2804- 
13; Wickham, T. J. et al., 1995 , Gene Ther., 2:750-6). Based on preliminary data, a 
short-shafted fiber was used so that the virus entry strategy predominantly depends on the 
interaction with the primary (hepatocyte-specific receptor). The hepatocyte-specific 

20 ligands are flanked by short glycine stretches to provide flexibility and embedded into a 
loop formed by two cystines. This is one of the classical strategies to incorporate ligands 
into a protein scaffold (Doi, N. and H. Yanagawa, 1998 , Cell Mol Life Sci., 54:394-404; 
Koivunen, E. et al., 1995 . Biotechnology (NY), 13:265-70) and to guarantee their 
presentation at the protein surface. The biodistribution of the best variants is tested in 

25 vivo in C57B1/6 mice based on Southern blots or PCR for vector DNA in different 
organs. This mouse strain in known to be susceptible to infection with P. berghei 
(Chatterjee, S. et al., 1995 . Infect Immun. , 63:4375-81). 
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K. Production Of Adenovirus Vectors With Increased Tropism To Tumor Cells 

A similar strategy is to insert two peptides obtained after selection for tumor tropism by 
displaying random peptides on filamentous phages. The first double cyclic peptide 
5 (RGD-4) proved to bind specifically to integrins present on tumor vasculature (Ellerby H. 
M. et ah, 1999 . Nat. Med. % 5:1032-8). The second peptide targets specific matrix 
metalloproteinases associated with metastatic tumor cells as shown for the breast cancer 
cell line MDA-MB-435 (Koivunen, E. et aL, 1999 , Na(. BiotechnoL, 17:768-74). 
Tropism-modified vectors are tested in animal models with hepatic metastases derived 
1 0 from MDA-MB-435 cells (Fig. 28). 

L. Development Of A Peptide Display Technique Based On Adenoviruses 

A synthetic peptide library is described that allows adenovirus vectors to express random 

15 peptides in the G-H loop of the fiber knob domain. The technique of a phage display 
library is optimized to generate a library of adenoviruses displaying random peptides in 
their fiber knob. This library of adenovirus variants is then screened for tropism to a 
particular cell type in vitro and potentially in vivo. The oligonucleotides encoding the 
peptide library employ a novel technique to assemble pre-synthesized trinucleotides 

20 representing the codons for all 20 amino acids. This will end the termination codons and 
assure optimal codon usage and translation in human cells. To maintain a representative 
library size, a new "seamless" cloning strategy that allows for insertion of the library into 
the wild-type Ad5 genome without introducing additional amino acids at the substitution 
site and without transformation into bacteria. Transfection into 293 cells is done by 

25 conjugating the viral library DNA to carrier Ad5-based adenovirus via polylysine to 
assure a 1 00% transfection efficiency. Importantly, the carrier adenoviral genome has its 
packaging signal flanked by lox sites preventing the packaging of carrier viral DNA after 
infection of 293 cells that express Cre recombinase (293 /ere). The library is produced 
with El -positive viruses depleted for CAR and integrin tropism. Only variant that have 

30 successfully infected the cell type of interest will replicate, resulting in de novo produced 
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virus. The sequence of the peptide ligand that conferred the particular tropism will then 
be analyzed in do novo produced virus. 

EXAMPLE III 

5 

COMBINATION NOVEL ADENOVIRAL VECTOR AND MODIFIED FIBER 
PROTEIN 

This example describes the following studies which combine the technology of the 
10 integrating adenovirus vector that is devoid of all adenoviral genes with the modified 
fiber protein that retargets the vector to quiescent HSC. 

A. Transduction studies with re-targeted vectors in HSC: 

15 In order to transduce quiescent HSC and integrate into chromosomal DNA, retargeted 
AAd.AAV fx vectors are tested for reporter gene expression, and vector integration 
simultaneously while analyzing their clonogenic capacity. The modified AAd.AAV 
hybrid vectors contain genomes devoid of ail adenoviral genes (a "gutless" adenovirus 
vector) packaged into Ad5 capsids with modified fibers. Rep may be incorporated into 

20 these AAd.AAV* vectors to allow for site-specific integration into AAVS1 . 

Transduction studies: 

Purified human CD34+ cells in IMDM/FCS+IL-3 and SCF are infected with different 
25 doses of AAd.AAV**-BG (1-10 7 genomes per cell). CD34-I- cells infected with 
AAd.AAV fx -PGal are cultured for 2 days in suspension and sort P-Gal+ cells by FACS 
using FDG as substrate. This determines the infection efficiency. P-gal expressing cells 
are then submitted to clonogenic assays in semi-solid cultures (in two dishes per MOI) in 
the presence of multiple cytokines. (IL-3, SCF, Epo, G-CSF, GM-CSF, IL-7, Tpo). A 
30 first set of semi-solid cultures can be evaluated after 7 days; another set can be analyzed 
after 14 days. Colonies that have formed in semisolid culture can be characterized by 
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light microscopy and subsequently stained with X-Gal staining. Most of the vector 
genomes should remain episomal and can be lost with successive cell divisions. Thus, 
while most cells can be X-Gal positive at day 2 or day 7 after infection, most of the larger 
colonies (analyzed at day 14 p.i.) may not stain homogeneously for p-Gal. A 
5 representative number of X-Gal positive and X-Gal negative colonies can be picked and 
analyzed for episomal and integrated vector DNA. The outcome depends on the MOI 
used for infection and the integration status of the vector. These studies determine 
whether hybrid vectors can infect primitive progenitors. 

10 Detailed characterization of hybrid vector integration: 

CD34+ cells can be infected with AAd.AAV*-SNori (MOI 1-10 7 ) and subjected to G418 
selection in methyl cellulose (MC) cultures in the presence of growth factors (IL-3 and 
SCF). The resulting colonies are a mixture of mainly myeloid cells. The number and 

15 morphology of G418 resistant colonies can be determined after 2 weeks of selection. 
This strategy may be disadvantageous in that the appropriate stem cell may not divide 
and form G418 resistant colonies under the specific culture condition used. Moreover, it 
may be difficult to perform G418 selection on a population of heterogenous cells, which 
vary in their sensitivity to G418. Therefore, another set of AAd.AAV fx -SNori infected 

20 CD34+ cells can be cultured in methyl-cellulose (+IL-3, SCF) without G418 selection. 
After 2-3 weeks, single colonies can be picked from both (w/ and w/o G418) MC 
cultures, morphologically characterized, and analyzed for integrated vector using the 
modified protocol developed for integration studies in a small number of cells (see Figure 
8). This strategy allows the assessment of whether hybrid vectors integrate into the 

25 genome of CD34+ cells cultured in the presence of growth factors. This study gives us 
an idea about potential position effects affecting neo or pgal expression from integrated 
vector copies and about the structure of the integrated vector and the flanking 
chromosomal regions. 

30 An alternative method to confirm vector integration: 
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Fluorescence in situ hybridization (FISH) analysis, can be performed in individual cells 
from MC colonies. CD34+ cells are cultured in MC in the presence of growth factors to 
induce cell division and subsequently treated with colchicine. Metaphase chromosome 
spreads are analyzed with biotin-ATP labeled probe specific for the pGal or SNori gene 

5 and a dioxigenin-UTP labeled probe for the human X-chromosome as an internal control 
(provided by Christine Disteche, University of Washington). Specific hybridization can 
be visualized with corresponding anti-biotin or anti-DIG antibodies labeled with different 
fluorochromes (e.g. FITC and Texas Red). Hybrid vector DNA may integrate as 
concatemers. which would facilitate detection by FISH. This technique allows one to 

10 localize the chromosomal integration sites of hybrid vectors. 

Test transduction into quiescent hone marrow subpopidations: 

Hybrid vectors described so far can be tested to see whether quiescent CD34+ cells can 

15 be stably transduced. To avoid significant cell proliferation, purified CD34+ cells are 
cultured in serum free IMDM supplemented with thrombopoietin (Tpo). Tpo can alone 
support the survival of stem cells without stimulating their active cell proliferation 
(Matsunaga, T. et al., 1998, Blood, 92:452-61; Papayannopoulou, T. et al., 1996, 
Experimental Hematology, 24:660-69). To analyze the proliferation status of CD34+ cells 

20 at the time point of infection with the hybrid vector AAd.AAV fx BG, BrdU is added 2 
hours before infection to the culture medium. One set of cells are maintained as 
suspension culture in IDAM containing Tpo only for two days. Another set of cells are 
grown in IDAM+Tpo supplemented with multiple cytokines. Forty eight hours after 
infection, CD34+ cells can be FACS sorted for beta Gal expression using FDG. FDG 

25 positive cells can be further analyzed for cellular DNA replication based on BrdU 
incorporation and for specific CD34+ subset markers. To do this, cytospins from FDG+ 
cells can be submitted to immunofluorescence with BrdU specific antibodies and with 
antibodies to specific cell surface markers (e.g. CD38, CD41). Alternatively, consecutive 
paraffin sections of the same cell can be analyzed for (a) transgene expression by X-Gal 

30 staining, (b) DNA synthesis based on BrdU incorporation, and (c) specific surface 
markers. This allows one to confirm that the culture conditions with Tpo alone prevent 
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significant genomic DNA replication and subsequent cell proliferation as well as to 
determine whether quiescent CD34+ cells can be infected based on beta Gal expression 
in cells where BrdU labeling is absent. 

5 Test hybrid vectors integration into quiescent CD34+ cells: 

Two sets of CD34+ cells are infected. The first set of □Ad.AAV rx SNori infected cells are 
cultured for 5-7 days in the presence of cytokines; the other set is cultured without 
cytokines. To maintain CD34+ cell viability without cytokines during this period, the 

10 cells are cultured in the presence of Tpo or underlaid with a stromal ceil line (AFT024) 
(Moore, K. et al. f 1997, Blood. 89:4337-47), which can maintain HSC viable for 4 to 7 
weeks. After this specific time period, both sets are submitted to clonogenic assays (in 
the presence of multiple cytokines) either in combination with G4I8 selection or without 
selection. Single colonies are analyzed morphologically and submitted to genomic DNA 

15 analysis (Figure 8) to determine the vector integration status. The ultimate proof for stem 
cell transduction is the in vivo survival/expansion assay. To do this, the CD34+ cells 
expressing beta Gal are used for transplantation experiments, If the number of FDG+ 
cells is not sufficient, total AAd.AAV fx BG infected cells as well as all AAd.AAV fx -SNori 
infected cells can be used directly without selection. Transplantation can be performed 

20 via tail vein injection into sublethally irradiated SCID NOD mice (Dao, M.A., et al M 
1998, Blood, 4, 1243-1255; Matsunaga, T. et al., 1998, Blood, 92:452-61). At different 
time points after transplantation (4 to 8 weeks), mice can be sacrificed to obtain bone 
marrow cells which then can be cultured in suspension until various assays are performed 
for X-Gal and cell markers as described earlier. These cells also can be submitted to a 

25 secondary colony assay in MC or secondary transplantation into SCID NOD mice. 
Furthermore, MC colonies derived from these cells can be analyzed for the presence of 
integrated vector DNA by the method illustrated in Figure 8. The expression and 
integration data together allow conclusions about the repopulation efficiency and about 
potential position effects. 

30 
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B. Optimization of AAd.AAV 1 * vectors for y r-globin expression in hematopoietic 
stem cells: 

One specific example of the invention is (a) to construct retargeted hybrid vector with 
5 the y -globin as the transgene under the control of erythroid cell specific promoter, (b) to 
analyze the level and kinetics of y -globin expression after transduction with hybrid 
vectors in in vitro and in vivo assays, (c) if required, to protect gene expression from 
position effects using y -globin LCRs or insulators incorporated into hybrid vectors, and 
(d) to study whether y -globin introns or heterologous introns can increase y -globin 
10 expression. 

Another central issue of the invention is to demonstrate that hybrid vectors can 
accommodate larger transgenes than rAAV and retroviruses. The insert size limitation of 
these vectors is 5kb. Transgene cassettes up to 8kb can be inserted into hybrid vectors as 

15 described. The maximal insert size may be about 14kb. if hybrid vectors are produced on 
the basis of E2a and/or E4 deleted rAd vectors in corresponding packaging cell lines. The 
maximal insert size in hybrid vectors is dictated by the packaging limit of first generation 
vectors (Ad. AAV) (<36kb) which are necessary intermediates for hybrid virus production 
at large scale. It is expected that stability and titer of Ad. AAV vectors with an 8kb globin 

20 gene cassette is comparable to the vector containing the 2.5-3.5kb cassette used in 
Ad.AAVBG, Ad.AAVl, and Ad.AAVSNori. The following example experiments 
address these issues. 

Production of kAd.AA l^with large globin expression cassettes'. 

25 

In order to improve the condition of sickle cell disease, the expression level of the 
transferred y-lobin gene must be at least 50% of that of each endogenous pgene. These 
levels of transgene expression can only be achieved by using optimal expression 
cassettes, including extended LCRs and intron containing gamma genes. (Forrester, 
30 W.C., et al., 1986, Proc. Natl Acad. Set USA 83, 1359-1363; Fraser, P., et al., 1998, 
Curr. Opinion in Cell B/o., 10, 361-365; Grosveld, F., et a.l, 1998, Seminars in 
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Hematology, 35, 105-111; Martin, D. et al., 1996. Current Opinion in Genetics and 
Development, 6:488-95), So far, most of the y-globin expression cassettes are designed 
for retroviral and rAAV vectors, thus, less than 5kb and have to be devoid of internal 
splice sites or poly adenylation signals. With integrating vectors described herein, it is 

5 possible to go beyond this size limitation. This allows one to improve y -globin 
expression in bone marrow cells in terms of an adequate expression level and long term 
persistence. For this purpose, y -globin constructs developed by Li et al (Emery, D. W., 
et al. 1999 Hum Gene Ther 10:877-88; Li, Q., et al. 1999. Blood 93:2208-16) or by Ellis 
et al (Ellis, J., et al., 1996, EMBO J., 15. 562-568; Ellis, J., et al., 1997, Nucleic Acids 

10 Res. 25, 1296-1302) is chosen. 

(i) The first cassette contains a y -globin expression unit used in retroviral vectors. 
This allows for a direct comparison between the two systems. This construct includes the 
beta promoter from -127 to the beta initiation codon, which is connected in frame with 
15 the gamma coding region. This beta promoter is combined with the 300bp HS40 derived 
from the human alpha globin locus, which acts as a strong enhancer for globin 
expression. The globin gene is the Llkb version with intron 1 and partially deleted intron 
2. A second cassette is generated containing the HS40 beta promoter and gamma globin 
gene with the complete 3.3kb gamma globin gene. 



(ii) The second construct contains the 6.5kb beta uLCR, which confer a dominant 
chromatin opening activity and an adequate level of gamma globin expression in 
transgenic mice. The LCR is linked to the short 1.1 kb version of the gamma globin gene 
or the complete 3.3kb gamma gene. 



(iii) Additional globin expression cassette can be generated which include 
insulators, MARs or SARs, as well as other elements that can improve transgene 
expression from integrated vectors or in transgenic animals, like introns derived from the 
HPRT or hGH genes (Chung, J.H., et al., 1997, Proc. Natl Acad Sci. USA 94, 575-580; 
30 Dunaway, M., et al, 1993, Mol Cell Biol, 17, 182-189; Felsenfeld, G., et al., 1996, 
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Proc. Nail. Acad. ScL USA 93 ? 93840-9388; Klehr, D., et al., 199K Biochemistry, 30, 
1264-1270). 

Transduction studies with AAd.AA Vfa-zlohin vectors: 

5 

Transduction studies with globin-hybrid vectors are performed as described earlier 
(Steinwaerder, D. S., et al. 1999. J Virol 73:9303-13). Transduced CD34+ cells are 
submitted to differentiation in colony assays or analyzed in vivo expansion assays in 
SC1D-NOD mice. MC-colonies or bone marrow cells from experimental mice are 
10 analyzed for globin expression. Gamma-globin expression is measured using fluorescent 
anti-gamma-globin antibodies. RNAase protection studies can be performed to 
specifically quantitate gamma globin mRNA in comparison with -globin RNA. For these 
studies around 10 4 -10 5 cells are needed per test. 

15 Position effects: 

In the absence of the LCR, globin genes are subjected to strong position effects when 
they are transferred into cultured CD34+ cells or erythroleukemic lines (Fraser, P., et al., 
1998, Curr. Opinion in Cell Bio., 10, 361-365; Grosveld, F., et a.l, 1998, Seminars in 

20 Hematology, 35, 105-111). Another concern is that site-specific integration of 
AAd.AA V/rep vectors into AAVS1 may silence transgene expression. If silencing 
happens, it can be overcome by incorporating LCRs such as the 6.5kb □ globin |iLCR 
(Ellis, J., et al., 1996, EMBO J., 15, 562-568; Grosveld, F., et a.l, 1998, Seminars in 
Hematology, 35, 105-1 1 1) or insulators into DAd.AAV based expression units. Insulators 

25 are DNA elements that protect an integrated reporter gene from chromosomal position 
effects or that block enhancer activated transcription from a downward promoter. 
Insulator elements are known for Drosophila melanogaster genes (Gypsy, suppressor of 
Hairy wing, scs, scs\ Fab-7). for the chicken beta-globin gene (HS4) and for the T cell 
receptor (BEAD1; 14, 21.25). Specifically, the Drosophila gypsy or the beta globin 

30 insulator can be inserted as two copies flanking the globin expression cassette into hybrid 
vectors. The position effects can be examined in transduced MC-colonies based on the 



105 



WO 00/73478 PCT/US00/1 5442 

analysis of integrated vector DNA (see Fig. 29) and gamma-globin mRNA quantification. 
Analogous studies can be performed on transduced human bone marrow cells obtained 
after transplantation of infected CD34+ cells into SCID-NOD mice. 

Intron effects on gamma-globin expression: 

A number of reports reveal that the deletion of globin introns. particularly the second 
intron of the beta and gamma genes, decrease globin mRNA stability and thus the 
expression level (Antoniou, M. et al., 1998, Nucleic Acid Res. t 26:721-9). RNA viruses 
such as onco-retro, lenti- and foami viruses are problematic as vehicles for intron- 
containing transgenes. Because A Ad. AAV is a DNA virus, it should package globin 
introns and LCRs if necessary without the decreased titers and rearrangements observed 
with retroviral vectors. 
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APPENDIX I 

HUMAN AND ANIMAL ADENOVIRUSES AVAILABLE FROM 
AMERICAN TYPE CULTURE COLLECTION 

1: Adenovirus Type 21 ATCC VR-1099 fNTAJD V-22 1-002-0 m 
2: SA18 rSimian adenovirus 181 ATCC VR-943 Classification 
3: SA17 rSimian adenovirus \7) ATCC VR-942 Classification 
4: Adenovirus Type 47 ATCC VR-1309 Classification: Adenov 
5: Adenovirus Type 44 ATCC VR-1306 Classification: Adenov 
6: Avian adenovirus Type 4 ATCC VR-829 Classification: Ad 
7: Avian adenovirus Type 5 ATCC VR-830 Classification: Ad 
8: Avian adenovirus Type 7 ATCC VR-832 Classification: Ad 
9: Avian adenovirus Type 8 ATCC VR-833 Classification: Ad 
10: Avian adenovirus Type 9 ATCC VR-834 Classification: Ad 
II: Avian adenovirus Type 10 ATCC VR-835 Classification: A 
12: Avian adenovirus Type 2 ATCC VR-827 Classification: Ad 
13: Adenovirus Type 45 ATCC VR-1307 Classification: Adenov 
14: Adenovirus Type 38 ATCC VR-988 Permit: PHS p ermit reou 
15: Adenovirus Type 46 ATCC VR-1308 Classification: Adenov 
16: Simian adenovirus ATCC VR-541 Classification: Adenovir 
17: SA7 (Simian adenovirus 16^1 ATCC V R-941 Classification: 
18: Frag adenovirus CFAV-H ATCC VR-896 Classification: Ad 
19: Adenovirus type 48 (candidate^) ATCC VR-1406 Classifica 
20: Adenovirus Type 42 ATCC VR-1304 Classification: Adenov 
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i 

21: Adenovirus tvne 49 ( candidate^ ATCC VR-1407 Classifira 

22: Adenovirus Tvne 41 ATCC VR-13Q5 Classification ■ Adennv 

23: Avian adenovirus Ty pe 6 ATCC VR-831 Permit: USDA perm. 
24: Avian adenovinis Type 3 (Inclusion body hepatitis virusl 

25: Bovine adenovirus Tvoe 3 ATCC VR'-639 Classification: A 

26: Bovine adenovinis Type 6 ATCC VR-642 Permit: USDA perm 

27: Canine adenovirus ATCC VR-800 Classification: Adenovfr 

28: Bovine adenovinis Tvoe 5 ATCC VR-641 Permit: USDA perm 

29: Adenovinis Tvoe 36 ATCC VR-913 Classification: Adenovi 

30: Ovine adenovinis tvoe 5 ATCC VR-1343 Classification: A 

31: Adenovinis Tvoe 29 ATCC VR-272 Classification: Adenovi 

32: Swine adenovinis ATCC VR-359 Classification: Adenovini. 

33: Bovine adenovinis Tvoe 4 ATCC VR-640 Permit: USDA oenn 

34: Bovine adenovinis Tvoe 8 ATCC VR-769 Permit: USDA perm 

35: Bovine adenovinis Type 7 ATCC VR-768 Permit: USDA perm 

36: Adeno-associated vims Type 2 ( AAV-2H1 ATCC VR-630 Cla 

37: Adenovinis Type 4 ATCC VR-4 Classification: Adenovinis 

38: Adeno-associated vims Type 3 ( AAV-3fD ATCC VR-681 Cla 

39: Peromvscus adenovinis ATCC VR-528 Classification: Aden ■ 

40: Adenovinis Tvoe 15 ATCC VR-661 Classification: Adenovi 

41: Adenovinis Tvoe 20 ATCC VR-662 Classification: Adenovi 

42: Chimpanzee adenovinis ATCC VR-593 Classification: Aden 

43: Adenovinis Type 31 ATCC VR-357 Classification: Adenovi ' 

44: Adenovinis Type 25 ATCC VR-223 Classification: Adenovi 
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45: Chimpanzee adenovi rus ATCC VR-392 Classification: Aden 
46: Chimpanzee adenovi rus ATCC VR-591 Classification: Aden 
47: Adenovirus Type 26 A TCC VR-224. Classification- ArWnvi 
48: Adenovirus Type 19 A TCC VR-254 Classification: AHennvi 
49: Adenoviais Type 23 ATCC VR-258 Classification: Adenovi 
50:', Adenovirus Type 28 ATCC VR-226 Classification: Adenovi 
51: Adenoviais Type 6 ATCC VR-6 Classification: Adenovirus 
52: Adenovirus Type 2 Antisenim: ATCC VR-1079 AS/Rab fNTA 
53: Adenoviais Type 6 ATCC VR-1083 fNTAID V-206-003-014) 
54: Ovine adenovirus type 6 ATCC VR-1340 Classification: A 
55: Adenovims Tvne 3 ATCC VR-847 fDerived from NTATP V-20 
56: Adenovirus Type 7 ATCC VR-7 Classification: Adenovirus 
57: Adenovims Type 39 ATCC VR-932 Classification: Adenovi 
58: Adenovirus Type 3 ATCC VR-3 Classification: Adenovirus 
59: Bovine adenoviais Type 1 ATCC VR-3 13 Classification: A 
60: Adenoviais Type 14 ATCC VR-15 Classification: Adenovir 
6 1 : Adenoviais Type 1 ATCC VR- 1 078 fNTAID V-20 1 -00 1-014) 
62: Adenoviais Type 21 ATCC VR-256 Classification: Adenovi 
63: Adenoviais Tvoe 18 ATCC VR-1095 (NT A TP V-2 1 8-003-0 1 41 
64: Baboon adenoviais ATCC VR-275 Classification: Adenovir 
65: Adenoviais Type 10 ATCC VR-I I Classification: Adenovir 
66: Adenoviais Type 33 ATCC VR-626 Classification: Adenovi 
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67: Adenovirus Tvne 34 A TCC VR-716 Classification- AnVntwi 
68: Adenovirus Tvne I S A TCC VR-1 6 Classification: Adenovir 
69: Adenovirus Tvne 77 AT CC VR-257 Classification- AHenrwi 
70: Adenovirus Type 24 AT CC VR-259 Classification- AHenovi 
71: Adenovirus Type 17 ATCC VR-1094 fNIAfD V-2 17-003-0 1 4) 
72: Adenovirus Type 4 ATCC VR-1081 fNIAfD V-2Q4-003-OI4 , > 
73: Adenovirus Type 16 ATCC VR- 17 Classification: Adenovir 
74: Adenovirus Type 17 ATCC VR-1 8 Classification: Adenovir 
75: Adenovirus Tvoe 16 ATCC VR-1 093 fNIAfD V-2 16-003-01 41 
76: Bovine adenovims Tvoe 2 ATCC VR-314 Classification: A 
77: SV-30 ATCC VR-201 Classification: Adenovirus Simian ( 
78: Adenovirus Type 32 ATCC VR-625 Classification: Adenovi 
79: Adenovirus Type 20 ATCC VR-233 Classification: Adenovi 
80: Adenovirus Type 13 ATCC VR-1 4 Classification: Adenovir 
81: Adenovims Type 14 ATCC VR-1 091 fNIAfD V-2 1 4-00 1-01 41 
82: Adenovirus Type 18 ATCC VR-1 9 Classification: Adenovir 
83: SV-39 ATCC VR-353 Classification: Adenovims. Simian f 
84: Adenovims Tvpe 1 1 ATCC VR-849 (Derived from NlAfD V-2 
85: Duck adenovirus (Eec drop syndrome') ATCC VR-92 1 Permi " 
86: Adenovirus Type 1 ATCC VR-1 Classification:- Adenovirus 
87: Chimpanzee adenovinis ATCC VR-594 Classification: Aden 
88: Adenovims Tvne 15 ATCC VR-1 092 fNIAfD V-2 15-003-0141 
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89: Adenovirus Type 13 ATCC VR-1090 fNTArP V-2 1 3-003-0141 
90: Adennvims Tvpe 3 ATCC VR-1368 (Derived from N1AID V-20 
91: SV-31 ATCC VR-204 Classification: Adenovirus. Simian ( 
92: Adenovirus Tvpe 9 ATCC VR-1086 fNIATD V-209-003-014) 
93: Mouse adenoviais ATCC VR-550 Classification: Adenoviai 
94: Adenovirus Tvpe 9 ATCC VR-10 Classification: Adenoviai 
95: Adenovirus Tvpe 4 1 ATCC VR-930 Classification: Adenovi 
96: CI ATCC VR-20 Classification: Adenovirus. Simian (Mast 
97: Adenovirus Tvpe 40 ATCC VR-931 Classification: Adenovi 
98: Adenovinis Tvpe 37 ATCC VR-929 Classification: Adenovi 
99: Marble spleen disease vims (Hemorrhagic enteritis virus 
100: Adenovinis Tvne 35 ATCC VR-718 Classification: Adenovi 
101: SV-32 fM3) ATCC VR-205 Classification: Adenoviais. Sim 
1M: Adenovirus Tvpe 28 ATCC VR-1 106 (NT AID V-228-003-014) 
103: Adenovinis Tvoe 10 ATCC VR-1 087 (NT AID V-2 10-003-014) 
1D4: Adenovinis Tvoe 20 ATCC VR-1 097 fNTATD V-220-003-014) 
105: Adenovirus Tvpe 2 1 ATCC VR -1098 (NJAfD V-221-01 1-014) 
106: Ad»nnvirm Tvo- ?S ATCC VR- 1 103 fNTATD V-225-003-014) 
in7. AH,nnvinK Tvne 26 ATCC VR-1 104 (N T A TP V-226-003-014) 
m.9. ArWwinKTvoe31 ATCC VR-1 109 fN TATD V-231-001-014) 
mo- Adenovirus Tvne 19 ATCC VR-1096 fNT ATD V-2 19-002-0 14) 

s f 

110: SV-36 ATCC VR-208 Classification: Adenovinis. Simian ( . 
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111: SV-38 ATCC VR-355 Classification: Adenovirus. Simian ( 



112: SV-25 (MS) ATCC VR-201 Classification: Adenovirus. Sim 
113: SV-15 Hvl4> ATCC VR-197 Classification: Adenovirus. Sim 
1 14-. Adenovirus Type 22 ATCC VR-1 100 fNTAfP V-222-003-014) 
115: SV-23 CM2) ATCC VR-200 Classification: Adenovims. Sim 
116: Adenovirus Type 1 1 ATCC VR-12 Classification: Adenovir 
117- Adenovirus Type 24 ATCC VR-1 102 fNIAfD V-224-003-0m 
118: Avian adenovims Type 1 (Chicken Embryo Lethal Orphan: C 
119: SV-1 1 fM5^ ATCC VR-196 Classification: Adenovirus. Sim 
120: Adenovirus Type 5 ATCC VR-5 Classification: Adenovirus 
17.1- Adenovirus Type 23 ATCC VR-1 101 rNTATD V-223-003-014) 
122: SV-27 ( M91 ATCC VR-202 Classification: Adenovirus. Sim 
123; Avian adenovirus Type 2 (GAL) ATC C VR-280 Classificati 
124: SV-1 flvtn ATCC VR-1 95 Classification: Adenovirus. Simi 
125: SV-17 fM61 ATCC VR-1 98 Classification: Aden ovims. Sim 
126: Adenovirus Tvne 29 ATCC VR-1 107 fNIAlP V-229-003-014) 
127: Adenovims Type 2 ATCC VR-846 C lassification: Adenovir 
128: SV-34 ATCC VR-207 Classification- Adenov ims. Simian f 
129: SV-20 (MTt ATCC VR-1 99 Classification: Adenovims. Sim 
130: SV-37 ATCC VR-209 Classification: Adenovim s. Simian ( 
131: SV-33 fM10^ ATCC VR-206 Classification: Ad enovims. Si 
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132: Avian adeno-associated vims ATCC VR-865 Classificatio 
133: Adeno-associated f satellite) vims Type 4 ATCC VR-646 
134: Adenovims Type 30 ATCC VR-273 Classification: Adenovi 
135: Adeno-associated ( satellite) virus Type 1 ATCC VR-645 
136: Infectious canine hepatitis fRubarth's disease. Fox ence 
137: ' Adenovirus Type 27 ATCC VR-1 105 fNIATD V-227-00-3-014) 
138: Adenovirus Type 12 ATCC VR-863 fDerived from NTAfD V-2 
139: Adeno-associated vims Type 2 (molecularlv cloned) ATCC 
140: Adenovirus Type 7a ATCC VR-848 fDerived from NTAfD V-2 
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What is claimed: 



1. 

5 
10 

2. 

15 

3. 

20 

4. 

25 

5. 

30 



A first generation recombinant adenovirus vector a portion of which integrates 
into a host cell genome, comprising: 

a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3' to the left adenovirus inverted 
terminal repeat sequence; 

c) A transgene cassette sequence 3* to the adenoviral packaging sequence; 

d) At least one adenoviral sequence which directs adenoviral replication; and 

e) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of the 
transgene cassette sequence into the host cell genome. 

The adenovirus vector of claim 1 , wherein the left and right adenovirus inverted 
repeat sequence and the packaging sequence are from the same adenoviral 
serotype. 

The adenovirus vector of claim 1. wherein the sequence which directs adenoviral 
replication comprises a sequence on the anti-parallel strand which encodes an 
adenoviral fiber protein including a fiber tail, a fiber shaft, and a fiber knob, 
wherein the fiber knob includes a G-H loop region. 

The adenovirus vector of claim 3, wherein the sequence on the anti-sense strand 
which encodes the fiber tail is from the same serotype as the adenoviral inverted 
repeat sequence. 

The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3' to the left inverted terminal repeat 



sequence; 
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c) A transgene sequence 3' to the polyadenylation sequence; 

d) A promoter sequence 3' to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3 5 to the polyadenylation 
sequence. 

6. The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A promoter sequence 3 ? to the left inverted terminal repeat sequence; 
- N A transgene sequence 3* to the promoter sequence; 

A polyadenylation sequence 3' to the transgene sequence; and 
A right cassette inverted terminal repeat sequence 3' to the polyadenylation 



10 c) 
d) 



sequence. 



15 7. The adenovirus vector of claim 5 or 6, wherein the left and the right cassette 
inverted terminal repeat sequences each comprise an adenoviral-associated 
inverted terminal repeat sequence. 

8. The adenovirus vector of claim 5 or 6, wherein the transgene sequence is selected 
20 from a group consisting of a therapeutic gene, a selectable gene, and a reporter 

gene. 

9. The adenovirus vector of claim 8 ? wherein the therapeutic gene is selected from a 
group consisting of gamma globin, and human alpha- 1 anti-trypsin. 

25 

10. The adenovirus vector of claim 8, wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline. and gentamycin. 

1 1 . The adenovirus vector of claim 8, wherein the reporter gene is selected from a 
30 group consisting of green fluorescent protein, beta galactosidase, alkaline 

phosphatase. 
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12. The transgene cassette of claim 5 or 6 further comprising an inverted repeat 
sequence located 3' to the left inverted terminal repeat sequence or located 5' to 
the right inverted terminal repeat sequence. 

13. The transgene cassette of claim 5 or 6. further comprising an insulator element. 

14. The transgene cassette of claim 5 or 6. further comprising a bacterial origin of 
replication. 

15. The adenoviral vector of claim 1, wherein the adenoviral sequences which direct 
adenoviral replication are selected from a group consisting of E2 and E4; El, E2 
and E4; E2 and E4; and E2, E3. and E4. 

16. A first generation recombinant adenovirus vector which targets a host cell of 
interest and a portion of which integrates into the host cell genome so targeted, 
comprising two DNA strands, each strand being antiparallel to the other, the first 
strand comprising: 

a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3' to the left adenovirus inverted 
terminal repeat sequence; 

c) A transgene cassette sequence 3' to the adenoviral packaging sequence; 

d) At least one adenoviral sequence which directs adenoviral replication; and 

e) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of the 
transgene cassette sequence into the host cell genome, and wherein the second 
strand comprises a sequence which encodes an adenoviral fiber protein that 
permits targeting of the vector into the host cell of interest. 
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17. The adenovirus vector of claim 16, wherein the adenoviral protein includes a fiber 
tail, a fiber shaft, and a fiber knob, wherein the fiber knob includes a G-H loop 
region. 

18. The adenovirus vector of claim 16, wherein the left and right adenovirus inverted 
terminal repeat sequences and the packaging sequence are from the same 
adenoviral serotype. 

19. The adenovirus vector of claim 17, wherein the fiber tail is from the same 
serotype as the left and right adenoviral inverted repeat sequences. 

20. The adenovirus vector of claim 17, wherein the fiber shaft is from a different 
serotype as the left and right adenoviral inverted repeat sequences. 

21. The adenovirus vector of claim 20, wherein the fiber shaft is from a serotype 
selected from a group consisting of serotype 3, 7, 9, 11, and 35. 

22. The adenovirus vector of claim 17 ? wherein the fiber shaft comprises a shortened 
length. 

23. The adenovirus vector of claim 17, wherein the fiber knob is from a different 
serotype as the left and right adenoviral inverted repeat sequences. 

24. The adenovirus vector of claim 23, wherein the fiber knob is from a serotype 
selected from a group consisting of serotype 3, 7, 9, 11, and 35. 

25. The adenovirus vector of claim 17, wherein the fiber knob is a modified fiber 
knob protein comprising the G-H loop replaced with a heterologous peptide 
ligand sequence which binds to at least one surface protein on the host cell of 
interest. 
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26. The adenovirus vector of claim 16, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3' to the left inverted terminal repeat 
sequence; 

c) A transgene sequence 3' to the polyadenylation sequence; 

d) A promoter sequence 3' to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

27. The adenovirus vector of claim 16, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A promoter sequence 3' to the left inverted terminal repeat sequence; 

c) A transgene sequence 3' to the promoter sequence; 

d) A polyadenylation sequence 3' to the transgene sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

27. The adenovirus vector of claim 26 or 27, wherein the left and the right cassette 
inverted terminal repeat sequences each comprise an adenoviral-associated 
inverted terminal repeat sequence. 

28. The adenovirus vector of claim 26 or 27, wherein the transgene sequence is 
selected from a group consisting of a therapeutic gene, a selectable gene, and a 
reporter gene. 

29. The adenovirus vector of claim 24, wherein the therapeutic gene is selected from 
a group consisting of gamma globin, and human alpha- 1 anti-trypsin 
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30. The adenovirus vector of claim 24, wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline, and gentamycin. 

31. The adenovirus vector of claim 24, wherein the reporter gene is selected from a 
5 group consisting of green fluorescent protein, beta galactosidase, alkaline 

phosphatase. 

32. The transgene cassette of claim 26 or 27 further comprising an inverted repeat 
sequence located 3' to the left inverted terminal repeat sequence or located 5' to 

10 the right inverted terminal repeat sequence. 

33. The transgene cassette of claim 26 or 27, further comprising an insulator element. 

34. The transgene cassette of claim 26 or 27, further comprising a bacterial origin of 
15 replication. 

35. The adenoviral vector of claim 16, wherein the adenoviral sequences which direct 
adenoviral replication are selected from a group consisting of E2 and E4; El, E2 
and E4; E2 and E4; and E2, E3, and E4. 

20 

36. A recombinant gutless adenovirus vector a portion of which integrates into a host 
cell genome, comprising: 

a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3' to the left adenovirus inverted 
25 terminal repeat sequence; 

c) A transgene cassette sequence 3' to the adenoviral packaging sequence; 
and 

d) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of the 
30 transgene cassette sequence into the host cell genome. 
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37. The adenovirus vector of claim 36, wherein the left and right adenovirus inverted 
repeat sequence and the packaging sequence are from the same adenoviral 
serotype, (this is the base vector) 

5 38. The adenovirus vector of claim 36, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3' to the left inverted terminal repeat 
sequence; 

10 c) A transgene sequence 3 ? to the polyadenylation sequence; 

d) A promoter sequence 3 ! to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

15 39. The adenovirus vector of claim 36, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A promoter sequence 3' to the left inverted terminal repeat sequence; 

c) A transgene sequence 3* to the promoter sequence; 

20 d) A polyadenylation sequence 3' to the transgene sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

40. The adenovirus vector of claim 38 or 39, wherein the left and the right inverted 
25 terminal repeat sequences each comprise an adenoviral-associated inverted 

terminal repeat sequence. 

41. The adenovirus vector of claim 38 or 39, wherein the transgene sequence is 
selected from a group consisting of a therapeutic gene, a selectable gene, and a 

30 reporter gene. 
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42. The adenovirus vector of claim 41, wherein the therapeutic gene is selected from 
a group consisting of gamma globin, and human alpha- 1 anti-trypsin 



43. The adenovirus vector of claim 4K wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline, and gentomycin. 

44. The adenovirus vector of claim 41 ? wherein the reporter gene is selected from a 
group consisting of green fluorescent protein, beta galactosidase, alkaline 
phosphatase. 

45. The adenovirus vector of claim 38 or 39 further comprising an inverted repeat 
sequence located 3^ to the left inverted terminal repeat sequence or located 5' to 
the right inverted terminal, repeat sequence. 

46. The adenovirus vector of claim 38 or 39, further comprising an insulator element. 

47. The adenovirus vector of claim 38 or 39, further comprising a bacterial origin of 
replication. 

48. The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises a 5' portion of a gene of interest. 

49. The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises a 3' portion of a gene of interest. 

50. A method of producing a resolved gutless adenovirus vector in a suitable cell, said 
method comprising introducing a first and a second adenovirus vectors of claim 1 
or 16 into the cell under suitable conditions so that the recombinant adenovirus 
vectors undergo homologous recombination thereby producing a resolved gutless 
adenovirus vector. 
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51 . A resolved gutless adenovirus vector produced by the method of claim 50. 



52. The method of claim 50, wherein the first adenovirus vector comprises a 
transgene cassette having a 5' portion of a gene of interest, and wherein the 
second adenovirus vector comprises a transgene cassette having a 3' portion of 
the gene of interest, and wherein a part of the 5 ? portion overlaps with a part of 
the 3' portion so that homologous recombination occurs. 

53. A method of producing a resolved gutless recombinant Ad vector by homologous 
recombination in a suitable cell, said method comprising contacting two parental 
recombinant Ad vectors, each comprising a transgene cassette containing a 
portion of a selected transgene with a region of overlapping homology, so that the 
first and second parental recombinant Ad vectors undergo homologous 
recombination at the region of overlapping homology, resulting in a resolved 
recombinant gutless Ad vector having both portions of the selected transgene, and 
wherein the selected transgene is within a transgene cassette flanked by a pair of 
ITRs. 

54. A resolved gutless adenovirus vector produced by the method of claim 53. 

55. An adenovirus library comprising a plurality of adenovirus vectors expressing 
fiber proteins which are displayed and modified by random peptide insertions. 

56. The library of claim 55, wherein said fiber protein so displayed comprises a 
random peptide substituted in the G-H loop of the fiber protein knob domain. 

57. A screening method for targeting adenovirus vectors for gene therapy comprising 
contacting the adenovirus library of claim 55 with a plurality of cells so that the 
cells are transduced with the adenovirus vectors of the adenovirus library 
transduction occurs and detecting the cells so transduced. 
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58. The adenovirus vector of claim 5, 6, 26, 27, 38, or 39 further comprising a 
nucleotide sequence encoding a rep78 protein. 
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RECOMBINANT ADENOVIRAL VECTORS EXPRESSING CHIMERIC FIBER PROTEINS FOR CELL SPECIFIC 
INFECTION AND GENOME INTEGRATION 

5 FIBER PROTEINS 

This invention was made, at least in part, with funding from the National Institutes of 
Health (Grant Nos. R01 CA 80192-01 and R21 DK 55590-01). Accordingly, the United 
1 0 States Government has certain rights in this invention. 

FIELD OF THE INVENTION 

This invention relates to the field of gene therapy, and in particular, to novel adenovirus 
15 (Ad) vectors that selectively infect cells for gene therapy, and to Ad vectors containing 
modifications of the fiber protein to allow retargeting of any adenovirus serotype. 

BACKGROUND OF THE INVENTION 

20 Gene transfer vectors require the efficient transduction of target cells, stable association 
with the host genome, and adequate transgene expression in the appropriate target cell, 
without associated toxic or immunological side effects,. Currently available viral vector 
systems, including recombinant retroviruses, adenoviruses and adeno-associated viruses, 
are not suitable for efficient gene transfer into many cell types. Retroviral vectors require 

25 cell division for stable integration. Recombinant adenoviruses are not able to infect many 
cell types important for gene therapy, including hematopoietic stem cells, monocytes, T- 
and B-lymphocytes. Moreover, recombinant adeno-associated vectors (AAV) integrate 
with a low frequency. 

30 First generation adenoviruses have a number of properties that make them an attractive 
vehicle for gene transfer (Hitt, M.M. et al. 1997 Advances in Pharmacology 40:137-205). 
These include the ability to produce purified virus at high titers in concert with highly 
efficient gene transfer of up to 8 kb long expression cassettes into a large variety of cell 
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types in vivo, including non-dividing cells. Limitations of first generation adenoviruses 
include the development of immune responses to expressed viral proteins resulting in 
toxicity and virus clearance. The episomal status of adenoviral DNA within transduced 
cells is another limitation of first generation Ad vectors. Stable integration of adenovirus 
5 DNA into the host genome is reported only for wild-type forms of specific subtypes and 
appears not to occur in a detectable manner with El/E3-deleted Ad 5 (adenovirus 
serotype 5) vectors widely used for gene transfer in vitro and in vivo [Hitt, M.M. et al. 
1 997 Advances in Pharmacology 40: 1 37-205]. 

10 Recombinant AAV vectors (rAAV) integrate with a low frequency (about 1 out of 20,000 
genomes) randomly as cocatemers into the host genome (Rutledge, E.A.; Russel, D.W. 
1997 J. Virology, 71, 8429-8436). The presence of two AAV inverted terminal repeats 
(ITRs) and as yet unknown host cellular factors seem to be the only requirement for 
vector integration (Xiao, X., et al, 1997, J. Virology, 71, 941-948; Balague, C, et al. 

15 1997, J. Virology, 71, 3299-3306; Yang, C.C. 1997, J. Virology, 71, 9231-9247). In the 
presence of the large AAV Rep proteins, AAV integrates preferentially into a specific site 
at human chromosome 19, called AAVS1 (Berns, K.L, 1996, Fields Virology, Fields, 
B.N. et al. (ed) Vol. 2, Lippincott-Rayen, Philadelphia, PA, 2173-2220). The AAV 
capsid is formed by three coat proteins (VP 1-3), which interact with specific heparin 

20 sulfates on the cell surface and probably with specific receptor(s). However, many cell 
types, including hematopoietic stem cells, lack these structures so that rAAV vectors 
based on AAV2 cannot infect or transduce these cells (Malik P. et al., 1 997, J. Virology, 
71, 1776-1783; Quing, K.Y., et al. 1998, J. Virology, 72, 1593-1599). Other 
disadvantages of rAAV vectors include the limited insert size (4.5-5kb) that can be 

25 accommodated in rAAV vectors lacking all viral genes and low transducing titers of 
rAAV preparations. 

Adenovirus infection is initiated by attaching to the cell surface of Ad 5 via its fiber 
protein (for a review, see Shenk, T. 1996 Fields Virology, Vol 2, Fields, B.N. et al (ed) 
30 Vol. 2, Lippincott-Raven, Philadelphia, PA, 21 1 1-2148). The distal, C-terminal domain 
of the trimeric fiber molecule terminates in a knob, which binds to a specific cellular 
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receptor identified recently as the coxackie-adenovirus receptor (CAR) (Bergelson, J.M. 
et al. Science, 275, 1320-1323). After binding, in an event independent of virus 
attachment, Arg-GIy-Asp (RGD) motifs in the penton base interact with cellular integrins 
of the a3 and 05 types. This interaction triggers cellular internalization whereby the 
5 virion achieves localization within the endosome. The endosomal membrane is lysed in a 
process mediated by the penton base, releasing the contents of the endosome to the 
cytoplasm. During these processes, the virion is gradually uncoated and the adenoviral 
DNA is transported to the nucleus where replication takes place. The terminal protein, 
which is covalently attached to the viral genome and the core protein V that is localized 

10 on the surface of the cores have nuclear localization signals (NLSs) (van der Vliet, B. 
1995, The Molecular Repertoir of Adenoviruses, Vol 2, Doerfler, W. and Boehm, P.(ed.), 
Springer Verlag, Berlin, 1-31). These NLSs play a crucial role in directing the adenoviral 
genome to the nucleus and probably represent the structural elements which allow 
adenovirus to transduce non-dividing cells. When the double-stranded, linear DNA 

15 reaches the nucleus, it binds to the nuclear matrix through its terminal protein. 

Since the cell types that can be infected with Ad5 or Ad2 vectors are restricted by the 
presence of CAR and specific integrins, attempts have been made to widen the tropism of 
Ad vectors. Genetic modification of adenovirus coat proteins to target novel cell surface 

20 receptors have been reported for the fiber (Krasnykh, V. et al. 1998 J. Virology, 72, 1844- 
1852, Krasnykh, V. et al. 1996 J. Virology, 70, 6839-6846, Stevenson, S.D., et al. 1997, 
J. Virology, 71, 4782-4790), penton base (Wickham, T.J., et al. 1996, J. Virology, 70, 
6831-6838; Wickham, T.J., et al. 1995, Gene Therapy, 69, 750-756), and hexon proteins 
(Crompton, J., et al. 1994,./ Gen, Virol 75, 133-139). The most promising modification 

25 seems to be the functional modification of the fiber protein or more specifically of the 
fiber knob as the moiety, which mediates the primary attachment. Two groups have 
reported the generation of fibers consisting of the Ad5 tail/shaft and the knob domain of 
Ad3 (Krasnykh, V. et al. 1996 supra, Stevenson, S.D., et al. 1997, supra). Recently, 
recombinant adenoviruses with fibers containing C-terminal poly-lysine, gastrin-releasing 

30 peptide, somatostatin, E-selectin-binding peptide, or oligo-Histidines were produced in 
order to change the native tropism of Ad5. Krasnikh et al. found (Krasnykh, V. et al. 
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1998 supra) that heterologous peptide ligands could be inserted into the HI loop of the 
fiber knob domain without affecting the biological function of the fiber. Based on studies 
with other Ad serotypes, it appears that the length of the fiber shaft is a critical element, 
determining the efficiency of interaction with cell surface integrins and the internalization 
5 process. Thus far, there is no reported data demonstrating successful retargeting of Ad5 
vectors for a specific cell type. 

Therefore, there is a present need for an improved adenovirus vector which can be 
targeted efficiently to a variety of cell types and tissues and remain stably integrated in 

10 the host genome with minimal antigenicity to the host. The present invention discloses 
novel chimeric adenoviral (Ad) Ad- AAV vectors, which express a modified fiber protein 
on their capsid, for specifically targeting the vector. Methods of making, uses and 
advantages of these vectors are described. In addition, the alteration described for the 
knob and shaft domains of the fiber protein provide a novel approach to retarget any 

1 5 adenovirus serotype for cell specific infection. 

SUMMARY OF THE INVENTION 

The present invention provides for novel chimeric Ad- vectors carrying transgene, or 
20 portions of transgenes for stable and efficient gene transfer into diverse cell types or 
tissues in a CAR- and/or a o P3/s- independent manner. Also provided are methods for 
producing such vectors and the use thereof for gene therapy to target a specific cell type 
or tissue. 

25 The recombinant adenovirus vectors of the invention (Example I) provide a novel design 
that allows for the easy production and delivery of a "gutless" adenoviral vector with the 
added advantage of stable integration of the transgene into the host genome of different 
cell type. The adenoviral vector described is devoid of all adenoviral sequences except 
for the 5 1 and 3* cis elements necessary for replication and virion encapsidation. The 

30 adenovirus-associated virus sequences of the invention comprising the 5 1 (right) and 3' 
(left) inverted terminal repeats (ITRs) flank the transgene gene cassette such that they 
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direct homologous recombination during viral replication and viral integration into the 
host genome. In one embodiment AAV-ITR flanking sequences are used. The vector 
also contains a selected transgene(s) operably linked to a selected regulatory element and 
a polydenylation stop signal, which is in turn flanked by the flanking sequences described 
above. The selected transgene(s)can be linked under the same regulatory elements or 
under separate regulatory elements in the same orientation or in opposite orientations 
with respect to each other. The selected transgene(s) are any gene or genes which are 
expressed in a host cell or tissue for therapeutic, reporter or selection purposes. This 
vector is characterized by high titer transgene delivery to a host cell and the ability to 
stably integrate the transgene into the host genome. Also provided is a method to 
improve the integration frequency and site specific integration by incorporating an AAV 
rep protein into the recombinant hybrid vector. 

The invention also provides chimeric fiber proteins (Example II), which includes 
naturally occurring fiber proteins in which a portion or portions of the sequence are 
modified to alter cell or tissue specificity of infection. Altered fiber protein sequences 
can include fiber protein domains (the knob domain, the shaft domain, and the tail 
domain) from other or the same adenovirus serotypes or from randomly selected 
peptides. A chimeric fiber protein can be entirely composed of non-naturally occurring 
sequences. The invention further relates to nucleic acid sequences encoding the chimeric 
fiber proteins. These nucleic acid sequences can be naturally occurring, a mixture of 
naturally occurring and non-naturally occurring sequences, or entirely non-naturally 
occurring sequences. 

The heterologous fiber protein sequences described herein can be inserted into any 
adenovirus based vector which contains a capsid, rendering the . virus capable of 
specifically infecting a given cell or tissue. Adenoviral vectors having such a 
heterologous fiber sequence can be used to direct gene transfer into desired cells. For 
stable integration of the transgene cassette into the host gemone, the chimeric Ad-AAV 
vector described in the invention is the preferred vector of use. 
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The invention also includes a library of adenoviruses displaying random peptides in their 
fiber knobs can be used as ligands to screen for an adenovirus variant with tropism to a 
particular cell type in vitro and in vivo. 

5 The chimeric Ad- vectors described herein include the Ad.AAV genome with a modified 
fiber protein expressed on its capsid. These chimeric vectors are designed to infect a 
wide variety of cells, in particular, the cells which can only be poorly transduced by the 
commonly used retroviral, AAV and adenoviral vectors. These cells include, but not 
limited to, hematopoietic stem cells, lung epithelial cells, dendritic cells, lymphoblastoid 

10 cells, and endothelial cells. Hematopoietic stem cells such as CD34+ cells can be targeted 
for gene therapy of sickle cell anemia and thalasemia using the vector described herein. 
The chimeric Ad- AAV vector capable of transducing genes into endothelial cells can be 
used in gene therapy for vascular diseases such as atherosclerosis or restinosis after 
coronary artery surgery. 

15 

BRIEF DESCRIPTION OF THE FIGURES 

Figs. 1 A-1C display a proposed mechanism for forming of AAd.AAVl genome. 

20 Figs. 2A and 2B show electron photomicrographs of hybrid virus particles: Fig. 2A 
shows Ad. AAV 1 and Fig. 2B shows AAd.AAVl. 

Fig. 3 illustrates analysis of AAd.AAVl genomes after transduction of SKHepl cells. 
Pulse field gel eletrophoresis (PFGE). 1x1 0 6 control Sk Hepl cells (SKHepl) (lanes 1-3, 

25 5, 9). SKHepl cells from G418 resistant pools (AAd.AAVl) (infected with AAAAAV1 
and selected for 4 weeks) (lanes 6-8, 10-12), or SKHepl cells collected at 3 days after 
infection with 2000 genomes Ad.AAV 1 (Ad) lanes 4, 13) are sealed in agarose plaques, 
lysed in situ and subjected to PFGE with or without prior digestion with restriction 
endonucleases. Southern Blot is performed with a SEAP specific probe. U = undigested, 

30 P = digested with PI- Seel, I = I-Ceul, E = EcoREI. 
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Figs. 4A and 4B show response of K562 and CD34+ cells respectively after infection 
with AAd.AAVBG. Cells are incubated for 6 hours with virus under agitation. At day 3 
after infection, transduction frequency is calculated based on the number of X-Gal 
positive cells. Viability is tested by trypan blue exclusion. N = 3, SEM < 10%. 

5 

Fig. 5 shows Rep expression in SKHepl and 293 cells after plasmid transfection. 5xl0 5 
cells are transfected with pAAV/Ad, pRSVrep, or pPGKrep by Ca-phosphate co- 
precipitation. Three days after transfection, cells are harvested. Lysates are separated on 
a 10% PA gel, followed by Western Blot with Rep specific antibodies (03-65169), 
10 American Research Products), and developed with ECL (Amersham). 

Fig. 6 shows detection of vector integration into AAVS1 by PFGE. 

Fig. 7 shows strategy for creating an AAd.AAV hybrid vector capable of site-specific 
15 integration. Arrows indicate promoters, (PA) = polyadenylation signal. *F = adenoviral 
packaging signal. 

Fig. 8A-8B shows vectors for transduction studies with SNori as expression unit and 
analysis of vector integration on genomic DNA from a small cell number. Analgous 
20 vector sets can be generated with P-galactosidase (BG) or green fluorescence protein 
(GFP) as reporter genes. 

Fig. 9 shows strategy for substituting the Ad5 fiber sequence by the heterologous fiber X 
genes using recombination in E. coli. 

25 

Fig. 10 shows the expression of CAR and a v -integrins on test cells. For flow cytometry 
analysis, HeLa, CHO, K562, and CD34+ cells were incubated with monoclonal anti-CAR 
(RmcB, 1 :400 dilution) or anti-a v -integrin antibodies (L230 3 1 :30 dilution). As a negative 
control, cells were incubated with an irrelevant mouse monoclonal antibody (anti-BrdU, 
30 1:100 dilution). The binding of primary antibodies was developed with anti-mouse IgG- 
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FTTC labeled conjugates (1:100 dilution). Data shown represent the average results of 
quadruplicate analyses performed on 1 0 4 cells. 

Fig. 1 1 shows the electron microscopy of adenovirus particles. Purified particles from 
5 Ad5, 9, and 35 were negative contrast stained and analyzed at a magnification of 
85,000x. Defective particles are highlighted by arrows. 

Fig. 12 shows the analysis of attachment and internalization of different serotypes to 
CHO, HeLa, K562, and CD34+ cells. Equal amounts of [ 3 H]-thymidine-labeled virions 

10 of Ads 3, 4, 5, 9, 35, and 41 (measured by OD26O, and equivalent to an MOI of 400 pfu 
per cell for Ad5) were incubated for one hour on ice as described in Materials and 
Methods. Cells were then washed, and the number of labeled virions bound per cell was 
determined. For internalization studies, viruses were first allowed to attach to cells for 1 h 
on ice. Then, unbound viral particles were washed out. Cells were then incubated at 37°C 

15 for 30 min followed by treatment with trypsin-EDTA and washing to remove 
uninternalized viral particles. The data were obtained from two to four independent 
experiments performed in triplicate. Note the different scale on the Y-axes for CD34+ 
cells. 

20 Figs. 13A-13C show attachment and internalization of different adenovirus serotypes to 
Hela, CHO and 293 cells respectively. 

Figs. 14A and 14B show attachment and internalization of different adenovirus serotypes 
to CD34+ and K-562 cells respectively. 

25 

Fig. 15A-15C shows the analysis of viral replication in K562 and CD34+ cells by 
Southern blot analysis of methylated viral DNA. Replication studies were performed 
with 1x1 0 5 K562 cells (A) or CD34+ cells (B), infected with methylated Ad5, Ad9 or 
Ad35. The lane labeled as "load" represents DNA that was extracted form the media/cell 
30 mixture immediately after adding the indicated viral dose to cells. The intensities of 
bands corresponding to methylated and un-methylated viral DNA indicate that -85% of 
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the input virus was methylated. To quantify adsorption and internalization, DNA analysis 
was performed after prior incubation of virus with cells at 0°C (adsorption) or 37°C 
(internalization). For dose dependent replication studies, the indicated viral dose 
(expressed as the number of genomes) was added to the cells, and cellular genomic DNA 
5 together with viral DNA was extracted 16 hours or 36 hours post-infection for K562 and 
CD34+cells, respectively. Identical amounts of sample DNA were analyzed by Southern 
blot. For quantification purposes, Ad9 replication was analyzed together with Ad5 using 
an Ad5/9 chimeric probe that hybridizes with DNA of both serotypes (C). The analysis of 
Ad5 versus Ad35 replication was performed with the corresponding Ad5/35 chimeric 

10 probe. Since separate hybridizations with both Ad5/35 and Ad5/9 probes gave identical 
signal intensities for Ad5 DNA only one panel is shown for Ad5 replication in test cells. 
To produce distinguishable fragments specific for the methylated or non-methylated 
status of viral genomes, Ad5 DNA was digested with Xho I, while Ad9 and Ad35 DNA 
was digested with Xho I and Hind III. The bands specific for methylated (not-replicated) 

15 viral DNA were ~12kb for Ad9, 35kb for Ad5, and ~12kb for Ad35. The fragments 
specific for non-methylated DNA were 5.8kb for Ad9, 6.1kb for Ad5, and 9.5kb for 
Ad35. Chimeric Ad5/9 and Ad5/35 DNA fragments (1.8kb) were used as quantification 
standards and applied onto gel together with digested viral/cellular DNA (shown on the 
left part of the figures). 

20 

Fig. 16A-16B shows the structure of AdSGFP and chimeric Ad5GFP/F35 vectors. A) 
Schematic diagram of the original E1/E3 deleted Ad5-based vector with GFP-expression 
cassette inserted into the E3 region (AdSGFP) and the chimeric vector Ad5GFP/F35 
containing the Ad5/35 fiber gene. The 2.2kb Ad5 fiber gene was replaced by a 0.9kb 

25 chimeric fiber gene encoding for the short shaft and knob of Ad35 by a technique that 
involved PCR-cloning and recombination in E.coli. Kpn I (K) and Hind III (H) sites 
localized within or around the fiber genes are indicated. The lower panel shows the 
detailed structure of the chimeric fiber region. The Ad5 fiber tail [amino acids (aa): 1-44] 
were joined in frame to the Ad35 fiber shaft starting from its first two amino acids (GV), 

30 which are conserved among many serotypes. A conserved stretch of amino acids TLWT 
marks the boundary between the last p-sheet of Ad35 shaft and the globular knob. The 
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Ad35 fiber chain tennination codon is followed by the Ad5 fiber poly-adenylation signal. 
The region of Ad5GFP/F35 encoding for chimeric fiber was completely sequenced with 
Ad5 specific primers (see Material and Methods). B) Restriction analysis of viral 
genomes. Viral DNA was isolated from purified AdSGFP and Ad5GFP/F35 particles as 
5 described elsewhere. One microgram of DNA was digested with Hind III or Kpn I and 
separated in ethidium bromide stained agarose gels (left panel) which were subsequently 
blotted and analyzed by Southern blot with an Ad5 E4 specific probe (nt 32,7775-33,651) 
(right panel). Specific patterns, designating the correct structure for both viral vectors 
were detected. The Hind III fragments specific for AdSGFP and Ad5GFP/F35 were 2.9kb 
10 and 4.9kb, respectively. The Kpn I fragment that confirmed the correct Ad5GFP/F35 
structure was 1.6kb compared to a 7.6kb AdSGFP fragment. M - lkb ladder (Gibco-BRL, 
Grand Island, NY). 

Fig. 17 shows the generation of AAd.AAV genenomes by recombination between 
15 inverted homology regions. Recombination between two inverted repeats (IR) present in 
one Ad.AAV vector. The first-generation Ad.AAV vector (~ 34kb) contains two 1 .2kb 
inverted homology elements flanking the transgene cassette. One AAV-ITR is inserted 
between the Ad packaging signal (\|/) and the left IR. During Ad replication, 
recombination between the Irs mediates the formation of the AAd.AAV genomes with the 
20 transgene flanked by Irs, AAVITRs, Ad packaging signals, and Ad ITRs. These 
genomes are efficiently packaged into Ad capsids. 

Fig. 18 shows the structure of Ad5/1 1, Ad5/35. Schematic diagram of the original E1/E3 
deleted AdSObased vector with GFP-expression cassette inserted into the E3 region 

25 (AdSGFP) and the chimeric vector Ad5GFP/F35 containing the Ad5/35 fiber gene. The 
2.2kb Ad5 fiber gene was replaced by a 0.9kb chimeric fiber gene encoding for the short 
shaft and knob of Ad35 by a technique that involved PCR-cloning and recombination in 
E.col. Kpn I (K) and Hind III (H) sites localized within or around the fiber genes are 
indicated. The lower panel shows the detailed structure of the chimeric fiber region. The 

30 Ad5 fiber tail [amino acids (aa): 1-44] were joined in frame to the Ad35 fiber shaft 
starting from its first two amino acids (GV), which are conserved atflong many serotypes. 
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A conserved stretch of amino acids TLWT marks the boundary between the last P-sheet 
of Ad35 shaft and the globular knob. The Ad35 fiber chain termination codon is 
followed by the Ad5 fiber poly-adenylation signal. 

5 Fig. 19 shows the cross-competition for attachment and internalization of labeled 
AdSGFP, Ad35, and chimeric Ad5GFP/F35 virions with unlabeled viruses, and with anti- 
CAR or anti-ctv-integrins Mab. (A) For attachment studies, 10 5 K562 cells were pre- 
incubated with a 100-fold excess of unlabeled competitor virus at 4°C for 1 h. Then, 
equal amounts of [ 3 H]-labeled viruses, at a dose equivalent to an MOI of 100 pfu per cell 

10 determined for AdSGFP, were added to cells followed by incubation at 4°C for 1 h. Cells 
were then washed with ice-cold PBS, pelleted and the percentage of attached virus (cell- 
associated counts per minute) was determined. For analysis of cross-competition for 
internalization, cells were pre-incubated with a 100-fold excess of competitor virus at 
37°C for 30 min before labeled virus was added. After an additional incubation at 37°C 

15 for 30 min, cells were treated with trypsin-EDTA for 5 min at 37°C, washed with ice-cold 
PBS, pelleted, and the percentage of internalized virus was determined. For controls, cells 
were incubated with labeled viruses without any competitors. Preliminary experiments 
had shown that the conditions chosen for competition studies allowed for saturation in 
attachment/internalization on K562 cells for all unlabeled competitors. (B) 10 s K562 cells 

20 were pre-incubated for 1 hour at 4°C with anti-CAR MAb (RmcB, diluted 1 : 1 00) or with 
anti-cxv-integrin MAb (L230, diluted 1:30), followed by incubation with labeled viruses 
according to the protocols for attachment or for internalization as described above. For 
each particular serotype, , the percentage of attached/internalized virus was compared to 
the control settings, where cells were preincubated under the same conditions with a 

25 1:100 dilution of an irrelevant antibody (anti-BrdU Mab) before addition of the labeled 
virus. Note that the specific competitors but not the corresponding controls significantly 
inhibited Ad5 internalization to a degree that is in agreement with published data (59). 
N>/=4 

30 Fig. 20. Cross-competition for attachment and internalization of [ 3 H]-labeled AdSGFP, 
Ad35, and chimeric Ad5GFP/F35 virions with unlabeled Ad3 virus (A), and of [ 3 H]- 
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labeled Ad3 virions with unlabeled viruses (B). 10 5 K562 cells were pre-incubated with 
a 1 00-fold excess of unlabeled viral particles according to attachment or internalization 
protocols described for Fig.6. Equal amounts of [ 3 H]-labeled AdSGFP, Ad5GFP/F35, or 
Ad35 (A) or [ 3 H]-labeled Ad3 (B) were added to cells at a dose equivalent to an MOI of 
5 100 pfu per cell for AdSGFP. In control settings, cells were incubated with labeled 
viruses without any competitors. N=4. 

Fig. 21 shows the transduction of CD34+, K562, and HeLa cells with AdSGFP and 
chimeric Ad5GFP/F35 vectors. 1x10 s cells were infected with different MOIs (pfu/cell) 
10 of viruses in 100 (j.1 of media for 6 hours at 37°C. Virus containing media was then 
removed, and the cells were resuspended in fresh media followed by incubation for 1 8 h 
at 37°C. The percentage of GFP expressing cells was determined by flow cytometry. N=3 

Fig. 22 shows the distribution of GFP-positive cells in subpopulations of human CD34+ 
15 cells expressing CAR or a v -integrins. lxlO 5 CD34+ cells were infected with AdSGFP or 
Ad5GFP/F35 at an MOI of 200 pfu/cell as described for Fig.8. Twenty-four hours after 
infection, cells were incubated with anti-CAR (1:100 final dilution) or anti-a v -integrin 
(1 :30 final dilution) primary MAbs for 1 h at 37°C. Binding of primary antibodies was 
developed with anti-mouse IgG-PE labeled secondary MAbs (1:100 final dilution) at 4°C 
20 for 30 min. For each variant, 10 4 cells were analyzed by flow-cytometry. The mock 
infection variants represent cells incubated with virus dilution buffer only. The quadrant 
borders were set based on the background signals obtained with both the GFP- and PE- 
matched negative controls. The percentages of stained cells found in each quadrant are 
indicated. The data shown were representative for three independent experiments. 

25 

Fig. 23 A-23B shows the distribution of GFP-positive cells in a subpopulation of human 
CD34+ cells, expressing CD34 and CD117 (c-kit). (A) Co-localization of GFP 
expression with CD34 or CD117: CD34+ cells were infected with AdSGFP or 
Ad5GFP/F35 at an MOI of 200 pfu per cell under the conditions described for Fig.8. 
30 Twenty-four hours after infection, cells were incubated with anti-CD34 PE-conjugated 
MAbs (final dilution 1:2) or with anti-CD117 PE-conjugated MAbs (final dilution 1:5) 
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for 30 min on ice, and 10 4 cells per variant were subjected to two-color flow cytometry 
analysis. For negative control staining, no antibodies were added to the cells before 
analysis. The mock infection variants represent cells incubated with virus dilution buffer 
only. The quadrant borders were set based on the background signals obtained with both 

5 the GFP- and PE-matched negative controls. The percentages of stained cells found in 
each quadrant are indicated. The experiment was performed two times in triplicates, and 
typically obtained results are shown. The SEM was less than 10% of the statistical 
average. (B) Transduction of CD34+/CD117+ cells with Ad5GFP and chimeric 
Ad5GFP/F35 virus vectors: CD34+ cells, cultured overnight before staining in media 

10 without SCF, were incubated with PE-labeled anti-CDl 17 MAb for 30 min on ice. The 
fraction of GDI 17-positive cells was sorted by FACS. More than 97% of sorted cells 
were positive for GDI 17. 1x10 s CD117+/CD34+ cells were infected with AdSGFP or 
Ad5GFP/F35 at an MOI of 200 pfii per cell, as for Fig.8. Twenty-four hours post 
infection, the percentage of GFP positive was determined by flow cytometry. For mock 

15 infection, CD117+/CD34+ cells were incubated with virus dilution buffer only. The 
infections were done in triplicates, and the average percentage of GFP-expressing cells is 
indicated on the corresponding histogram. The SEM was less than 10% of the statistical 
average. 

20 Fig. 24 shows the southern analysis of viral genomes in GFP-positive and GFP-negative 
fractions of CD34+ cells infected with the AdSGFP and chimeric Ad5GFP/F35 vectors. 
CD34+ cells were infected with viruses at an MOI of 100 as described for Fig.21 . Twenty 
four-hours post infection, cells were sorted by FACS for GFP positive and GFP negative 
fractions. 10 5 cells from each fraction were used to isolate genomic DNA together with 

25 viral DNA. Before cell lysis, a rigorous treatment with trypsin and DNase followed by 
washing was performed to exclude that genomic DNA samples were contaminated by 
extracellular viral DNA. A) The upper panel shows the ethidium bromide stained 1% 
agarose gel before blotting demonstrating that similar amounts of genomic DNA were 
loaded. This amount corresponded to DNA isolated from -25,000 GFP+ or GFP- cells. 

30 The lane labeled Aload@ represents viral DNA purified from AdSGFP or Ad5GFP/F35 
virions mixed with pBluescript plasmid DNA (Stratagene) as a carrier and applied on a 
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gel at the amount that was actually used to infect 25,000 cells. As a concentration 
standard, a serial dilution of AdSGFP genomes was loaded on the gel (left side). For 
Southern analysis (lower panel), an 8kb-long Hindffl fragment corresponding to the E2 
region of Ad5 was used as a labeled probe. Hybridized filters were subjected to 
5 Phospholmager analyis and then exposed to Kodak-X-OMAT film for 48 h at B70°C. 
The cellular/viral genomic DNA is indicated by an arrow. (B) To detect AdSGFP 
genomes in transduced cells, PCR amplification followed by Southern blot hybridization 
was performed on the same samples that were used for quantitative Southern blot 
hybridization in (A). DNA purified from -2,500 cells was subjected to PCR (95°CBlmin, 

10 53°C-lmin, 72°CB lmin, 20 cycles with primers Ad5-Fl and Ad5-Rl). One fifth of the 
PCR reaction was subjected to agarose gel electrophoresis (upper panel). A 0.9 kb-long 
DNA fragment, specific to the E4 region of Ad5 was detected for transduced 
Ad5GFP/F35 genomes. DNA then was blotted onto Nybond-N+ membrane and Southern 
blot hybridization (lower panel) with an Ad5 E4 specific DNA probe was performed. In 

15 addition to the 0.9kb DNA fragment, the PCR primers generated a smaller 0.5 kb-long 
fragment that also hybridized with with the E4 region probe. 

Fig. 25 shows the role of fiber shaft length in Ad infection strategies. CAR binding (Ad5 
and Ad9) variants and Ad35, which interacts with a non-CAR receptor were analyzed on 
20 CAR expressing cells (293, Y79) and K562 cells which do not express significant CAR 
amounts. All vectors contain a GFP expression cassette packaged into an Ad5 capsid 
with modified fibers. 

Fig. 26 shows the tertiary structure of Ad5 knob: localization of CAR binding sites, H-I 
25 loop and G-H loop. 

Fig. 27 shows the substitution of the G-H loop with heterologous peptides. 

Fig. 28 shows the attachment and internalization of metabolically labeled serotypes with 
30 human cell lines. 
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within a host cell. Accordingly, the transgene sequence may be a replicon sequence that 
directs replication of the vector within the host cell, resulting in an autonomously replicating 
vector. Alternatively, the transgene sequence may permit vector replication that is 
dependent upon the host cell's replication machinery. 

5 

The base vector sequences may be a operatively linked to a transgene sequence that 
encodes a gene product, such as a polypeptide, rRNA, or tRNA. For example, the 
transgene may encode a polypeptide such as a viral capsid protein, or a viral fiber protein. 
The transgene may be derived from the same or different serotype as the base vector 
10 sequence. 

Another example of a transgene includes a reporter gene that encodes a gene product that 
can be used as a selectable marker* such as drug resistance or a colorimetric marker. The 
reporter gene may encode a gene product which can be readily detected by, for example, 
15 a visual microscopic, immunochemical, or enzymatic assay. The preferred reporter gene 
encodes a gene product that can be detected by a non-destructive method that does not 
destroy the cell that expresses the reporter gene. 

A therapeutic gene is another example of a transgene. A therapeutic gene encodes a gene 
20 product (e.g., polypeptide or RNA) which when expressed in a host cell provides a 
therapeutic benefit or desired function to the host cell or the tissue or the organ or the 
organism containing the host cell. The therapeutic benefit may result from modifying a 
functin of a gene in the host genome or from the additional function provided by the 
therapeutic protein, polypeptide or RNA. 

25 

The base vector sequence may be linked to a transgene sequence that is an regulatory 
element, such as a promoters, enhancers, transcription termination signals, 
polyadenylation sequences. The regulatory element may direct expression of the 
transgene sequence that encodes a gene product by direct transcription or translation. 
30 The regulatory element may regulate the amount or timing of expression of the transgene 
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sequence. The regulatory element may direct expression of the transgene in certain host 
cells or tissues (e.g., host-specific or tissue-specific expression). 

The base vector sequence may linked to a transgene sequence that permits the vector, to 
5 integrate into another nucleotide sequence. The integration sequence may direct 
integration of the whole vector or portions of the vector. The integration sequence may 
or may not be related to the base vector sequence. For example the integration and base 
vector sequences may be from the same or different viral serotype. The integration 
sequence may be inverted repeat sequences (ITRs) from adenovirus (Ad), adenovirus- 
10 associated virus (AAV), or HTV. 

The base vector sequence may be linked to a transgene sequence that directs homologous 
recombination of the vector into the genome of a host cell. Such transgene sequences 
may or may not be from the same viral serotype as the base vector sequence. 



The vector may be used to transport the heterologous sequence into a host cell or into a 
host cell's genome. 

The vector may comprise multiple endonuclease restriction sites that enable convenient 
20 insertion of exogenous DNA sequences. 

The term "hybrid vector" as used in the invention refers to a vector which comprises a 
nucleic acid sequence combined from two different viruses (e.g. Adenovirus and AAV). 

25 "Chimeric vector" refers to a vector which contains nucleic acid sequences that are 
unnatural to the base vector (i.e. sequences riot occurring naturally or sequences not in 
their natural background including heterologous sequences). A chimeric vector as used 
in the invention may also be a hybrid vector. An example of a chimeric vector is 
Ad. AAV expressing a modified fiber protein an its capsid. 
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The term "transduction" or "infection" refers to a method of introducing viral DNA 
within a virus particle into a host cell. The viral DNA herein is in the form of 
recombinant virus, which is generated by linking a segment of DNA of interest into the 
viral genome in such a way that the gene can be expressed as a functional protein. 

The term "transfection" refers to a method of introducing a DNA fragment into a host 
cell. 

The term "heterologous" as used herein means that a nucleic acid or peptide sequence is 
placed in a content that is not endogenous to the base adenovirus vector or to a 
transduced cell. For example, a peptide sequence can be transferred from a protein to 
another protein, the resulting protein is referred to herein as heterologous protein. A 
chimeric fiber protein, (e.g., a serotype 5 tail domain and a serotype 35 shaft and knob 
domain) is considered a "heterologous" to the Ad 5 vector. The term also includes 
nucleic acids (e.g. coding sequences) from one strain or serotype of adenovirus 
introduced into a different strain or serotype of adenovirus. 

The term "regulatory elements" is intended to include promoters, enhancers, transcription 
termination signals, polyadenylation sequences, and other expression control sequences. 
Regulatory elements referred to in the invention include but are not limited to, those 
which direct expression of nucleic acid sequence only in certain host cells (e.g. tissue 
specific regulatory sequences). 

The term "operably linked" indicates that a polynucleotide sequence (e.g. a coding 
sequence or gene) is linked to a regulatory element in such a way that the regulatory 
element sequence controls and regulates the transcription or translation or both of that 
polynucleotide sequence. The orientation of the regulatory element may vary (eg, be in 
reverse orientation with respect to the right ITR). The term also includes having an 
appropriate start signal (e.g., ATG) in front of the polynucleotide sequence to be 
expressed and maintaining the correct reading frame to permit expression of the 
polynucleotide sequence under the control of the expression control sequence and 
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production of the desired polypeptide or protein. Regulatory sequences can also include 
3* sequences which ensure correct termination (eg. polyadenylation stop signal). 

The term "gene therapy" used herein, refers to a method which introduces a segment of 
5 exogenous nucleic acid into a cell in such a way that it results in functional modification 
to the recipient cell by expression of the exogenous nucleic acid. The exogenous nucleic 
acid is typically therapeutic in that the expression of the encoded protein, polypeptide or 
RNA corrects cellular dysfunction due to a genetic error or more generally counteracts 
any undesirable functions which are associated with a genetic or acquired disease. The 
10 term "exogenous nucleic acid" refers to DNA or RNA sequences not normally expressed 
in the treated transformed cell. The term also refers to DNA and RNA sequences which 
are expressed in a treated transformed cell at a higher, lower or in an otherwise different 
pattern than in the untreated, nontransformed cell. This non-natural expression can also 
be termed heterologous expression. 

15 

A "gene therapy vector" refers to a vector used for gene therapy, i.e. to introduce the 
exogenous nucleic acid into a recipient or host cell. The exogenous nucleic acid may be 
transiently expressed or integrated and stably expressed in the recipient or host cell. 

20 The term "plasmid" as used herein refers to any nucleic acid molecule which replicates 
independently of the host, maintains a high copy number, and which can be used as a 
cloning tool. 

The term "parallel strand of DNA" and "anti-parallel strand of DNA" refers to as each of 
25 the strands of DNA of the double stranded adenovirus. The Figures diagram the location 
of certain nucleotides on the parallel strand of DNA. The anti-parallel strand of DNA 
refers to the other of the two strands of DNA which is not depicted in the Figures. The 
fibet protein is encoded on the anti-parallel strand of DNA. To simplify the vector 
diagrams, the fiber sequences are shown on the parallel strand even though the gene is 
30 located on the anti-parallel strand. 
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The term "reporter gene" refers to any nucleic acid sequence which encodes a 
polypeptide or protein which can be readily detected by, for example, a visual, 
microscopic, immunochemical or enzymatic assay. Preferred reporter genes are those 
5 that can be detected by a non-destructive method that does not destroy the treated, 
transformed cells or tissue. 

The term "selection gene" used herein refers to any nucleic acid fragment which encodes 
a polypeptide or protein whose expression is used to mark a cell as a transformed cell by 
10 a given vector. 

The term "therapeutic gene" refers herein to a DNA fragment encoding a functional 
polypeptide, protein or RNA, which when expressed in a host cell provides a therapeutic 
benefit or desired function.to the host cell or to the organ or organism containing the host 
15 cell. The therapeutic benefit may result from modification of a function of a native gene 
in a host or from the additional function provided by the therapeutic protein, polypeptide 
or RNA. 

The term "host tissue" or "host cell" as used herein, refers to a tissue or cell in which a 
20 therapeutic gene is to be expressed to modify its function. 

It is well-known in the biological arts that certain amino acid substitutions may be made 
in protein sequences without affecting the function of the protein. Generally, 
conservative amino acid substitutions or substitutions of similar amino acids are tolerated 

25 without affecting protein function. Similar amino acids can be those that are similar in 
size and/or charge properties, for example, aspartate and glutamate, and isoleucine and 
valine, are both pairs of similar amino acids. Similarity between amino acid pairs has 
been assessed in the art in a number of ways. For example, Dayhoff et al. (1978) in Atlas 
of Protein Sequence and Structure, Volume 5, Supplement 3, Chapter 22, pp. 345-352, 

30 which is incorporated by reference herein, provides frequency tables for amino acid 
substitutions which can be employed as a measure of amino acid similarity. Dayhoff et 
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al ! s frequency tables are based on comparisons of amino acid sequences for proteins 
having the same function from a variety of evolutionary different sources. Therefore, 
any obvious changes in the amino acid sequences (as described above) to the sequences 
of the invention are already contemplated. 

Polypeptides which are "substantially similar" share sequences as noted above except that 
residue positions which are not identical may differ by conservative amino acid changes. 
Conservative amino acid substitutions refer to the interchangeability of residues having 
similar side chains. For example, a group of amino acids having aliphatic side chains of 
amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of 
amino acids having amide-containing side chains is asparagine and glutamine; a group of 
amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a 
group of amino acids having basic side chains is lysine, arginine, and histidine; a group 
of amino acids having sulfur-containing side chains is cysteine and methionine. 
Preferred conservative amino acids substitution groups include but are not limited to: 
valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, 
asparagine-glutamine, and aspartate-glutamate. Therefore, polypeptide substitution for 
"substantially similar" sequences (as described above) to the amino acid sequences 
described invention are already contemplated. 

In order that the invention herein described may be more fully understood, the following 
description is set forth. 

The present invention provides unique gene transfer vehicles which overcome many of 
the limitations of prior art vectors. The invention describes a first generation adenovirus 
vectors comprising left and right Ad ITRs, an Ad packaging sequence, a transgene 
cassette with regulatoiy elements, and a pair of cassette ITRs flanking the transgene 
cassette that direct predictable viral genomic rearrangements during viral replication as 
well as direct the integration of the transgene cassette into the host cell genome. 
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One predictable rearrangement that occurs during viral replication is the generation of a 
gutless adenovirus vector (also referred to herein as AAd) that comprises right and left Ad 
ITRs, an Ad packaging sequence, a transgene cassette flanked by cassette ITRs and the 
gutless vector is devoid of all other immunogenic viral genes. 

The potential for site-specific integration is an important characteristic of the novel Ad 
vectors of the invention. In an embodiment of the invention, integration of the transgene 
cassette is directed by co-infection with an Ad vector expressing e.g., the rep 78 protein 
to achieve site-specific integration in the e.g., AAVS1 site on human chromosome 19. 

The invention further describes a novel way of targeting these recombinant adenovirus 
vectors to selected cells by modifying the adenovirus fiber protein that is expressed on 
the capsid. Changes to both the fiber shaft and the fiber knob domain proved to 
successfully retarget the Ad vector to a desired cell type. In addition, the G-H loop within 
the fiber knob domain is identified as a novel site that affect the binding affinity and 
specificity of the recombinant adenovirus vector. Substitution of peptide sequences into 
the G-H loop retarget the gutless vector to a desired cell type. 

An adenovirus display library has been generated that expresses random peptides within 
the G-H loop of the fiber protein. This type of a library is used as ligands to screen for 
adenovirus vectors that bind to desired cell types. One advantage of using an adenovirus 
display library versus a phage display library is that once adenovirus affinity to a desired 
cell is identified the targeted adenovirus vector is ready to accept a transgene cassette and 
can be used to generate a gutless adenovirus vector, for example for use in gene therapy. 

The chimeric Ad vectors described belpw contain a modified fiber protein in the capsid 
of the adenovirus which renders the vector capable of infecting a desired cell types. 
Therefore, according to the invention, a gutless chimeric AAd-AAV vector can be 
generated to introduce any transgene(s) into any host cell or tissue which is normally 
refractory to most commonly used gene therapy viral vectors. In addition, the chimeric 
AAd.AAV vector of the invention, is devoid of adenoviral genes, and contains AAV ITR 
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sequences that flank the transgene cassette, which direct stable transgene integration in 
the host genome allowing long term expression of the transgene. 



The transgene cassette described in the invention may carry a transgene which is either a 
reporter gene, a selectable gene for in vitro or in vivo selection of transduced cells, or a 
therapeutic gene. In one embodiment of the invention the reporter trangene can be but is 
not limited to, Pgalactosidase. Many reporter genes are commonly used in the art, of 
which any could be carried as a transgene in the Ad. AAV vector of the invention. Other 
examples of reporter are genes are GFP and alkaline phosphatase. 

The following describes an embodiment of the first generation Ad vectors of the 
invention having a wild-type capsid and a transgene cassette flanked by cassette ITR 
sequences; (b) fiber protein that is modified to retarget Ad vectors; and (c) the 
combination of both technologies that enables the production of chimeric AAd vector 
including a modified fiber protein expressed on the capsid which retargets the base vector 
to a desired cell type for infection and transgene integration. 

A. Integrating Ad hybrid vectors of the invention: 

It has been shown that inverted repeats (IRs) inserted into the El region of AdEl- vectors 
can mediate predictable genomic rearrangements resulting in a gutless vector genome 
devoid of all viral genes. A specific embodiment of such IR-mediated rearrangements is 
the Adeno-AAV, first generation hybrid adenovirus vector containing AAV inverted 
terminal repeats (ITR) flanking a transgene cassette. The AAV ITRs mediate the 
formation of a genome similar to that of the AAd.IR genome (Steinwaerder et al., 2000 
Journal of Virology). AAd vectors devoid of all viral genes stably integrate and transduce 
cultured cells with efficiencies comparable to e.g. rAAV vectors. The Examples 
demonstrate by Southern blot analysis that the AAd vectors integrate randomly into the 
host genome. 

The Ad vectors of the invention comprise a left Ad ITR, an adenovirus packaging 
sequence located 3' to the Ad ITR; a transgene cassette located 3' to the packaging 
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sequence comprising a polyadenylation signal, a transgene, and a heterologous promoter, 
and flanked by a pair of cassette ITRs. Adenoviral genes used for replication such as El, 
E2, E3, E4 are located 3' to the right cassette ITR and a right Ad ITR is located 3' to the 
replication genes. The vectors of the invention are particularly suited to treat: genetic 
disorders, cancers, and infectious diseases (such as HIV, emboli, or malaria). Treatable 
genetic diseases such as hemophilia A and B; cystic fibrosis; muscular dystrophy, and 
ctj antitrypsin disorder are ideal candidates for genetic disease that can be treated by 
vectors of the invention. A specific example of a therapeutic gene to combat a genetic 
disorder is gamma-globin to ameliorate sickle cell anemia. 

To aid in the selection of transduced cellls and characterize the intergration site of the 
transgene cassette, an embodiment of the invention includes the addition of a sequence 
comprising a bacterial for the origin of replication, plus a selectable gene. An 
embodiment of this is an SNori sequence added to the transgene cassette. This allows the 
AAd to be expressed in human and bacterial cells, therefore allowing selection of the 
transduced cells and characterization of the integration site in the genome of transduced 
mammalian cells. 

The potential for site-specific integration is an important characteristic of the novel Ad 
vectors of the invention. In an embodiment of the invention, integration of the A Ad. AAV 
is directed by co-infection with Ad AAV expressing the rep 78 protein in 293 cells to 
achieve site-specific integration in the AAVS1 site on human chromosome 19. For this 
type of site-specific integration to occur in cells other than 293 cells, E4 ORF6 
expression is required. The co-infection of AAd.AAV, AAd. rep 78, and AAd. E4-orf6 
allows for site specific integration of the AAd.AAV transgene cassette. The AAd. rep78 
and the AAd. E4-orf6 genomes are degraded soon after transduction, thus avoiding 
potential side effects. Site-specific integration is preferred over random integration, 
which is seen with rAAV and AAd.AAV, in order to reduce the risk of insertional 
mutagenesis. 
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Integration of the transgene cassette contained in the adenoviral vectors into 
chromosomes may be associated with silencing (or blocking) of transgene expression. 
The silencing of transgenes can be overcome by adding insulator elements to the 
transgene cassette. For example, HS-4 insulator elements derived from the chicken- 
5 globin LCR can function in Ad vectors to shield heterologous promoters from adenoviral 
enhancers. HS-4 insulators or the Drosophila Gypsy gene can also be used to prevent 
silencing transgenes. 

Another embodiment of the invention is to split the transgene cassette into two portions 
10 of the transgene each carried in a different recombinant adenoviral vector of the 
invention. Each portion of the same trangene has an overlapping region of homology. 
After infection with both vectors, each carrying the different but overlapping portion of 
the same transgene, homologous recombination event occurs resulting in the 
reconstitution of the complete transgene which is then expressed. This technique is used 
15 to produce hybrid adenoviral vectors that accommodate large inserts including, but not 
limited to a 13kb genomic hAAT gene or a 12kb yglobin LCRyglobin expression 
cassettes for ameliorating sickle cell anemia (or correcting y-globin mutations). The 
formation of the hybrid AAd vector genomes, after recombination between two vectors, is 
more efficient if the overlapping region of homology within the transgene is longer. 



An advantage of the present invention is a method to rapidly isolate pure gutless hybrid 
adenoviral vectors such as AAd.AAV or AAd.AAV fx vectors. To minimize the 
contamination of AAd with first generation vectors (Ad vectors) a strategy is described in 
Example I H. It is anticipated that these approaches will yield the same titer of AAd 
25 vectors, however the contamination with full-length genome vectors will be less. This 
improved isolation of the vectors is extremely important to avoid toxic side effects after 
in vivo application. 

B. Tropism modified adenovirus vectors: 
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The Ad vectors of the invention can be modified so that they target a host cell of interest. 
There are more than 50 human Ad serotypes (Appendix I), including variants with 
different tissue selectivity or tropism. It is accepted in the art that different Ad serotypes 
bind to different cellular receptors and use different entry mechanisms. Most recombinant 
adenovirus vectors use adenovirus serotype 5 as the base vector serotype 5 (Ad5) (Hitt, 
M.M., et all, 1997, Adv. in Pharmacology 40, 137-205). Ad5 infection is primarily 
mediated by its fiber protein binding to CAR and secondarily by its penton base protein 
binding to integrin. Due to the lack of CAR and/or integrin expression on many cell and 
tissue types, Ad5 mediated gene transfer is inefficient in a number of tissues which are 
important targets for gene therapy such as endothelia, smooth muscle, skin epithelia, 
differentiated airway epithelia, brain tissue, peripheral blood cells, or bone marrow. The 
following describes Ad5 vectors of the invention having a change in infectivity and 
tropism as a result of altering the fiber protein sequence. 

The infectivity of different Ad serotypes is limited to a number of human cell lines. 
Infectivity studies revealed that Ad5 and Ad3 are particularly suitable for infecting and 
targeting endothelial or lymphoid cells, whereas Ad9, Ad 1 1 and Ad35 efficiently infected 
human bone marrow cells. Therefore, the knob domain of the fiber protein of Ad9, Adl 1 
and Ad35 are excellent candidates for retargeting the Ad5 vector to human bone marrow 
cells. Other possible serotypes include Ad7. 

In the modified fiber protein of the invention the fiber knob domain of the Ad5 fiber has 
been replaced with another Ad serotype fiber knob domain. An embodiment of the 
invention is the modified Ad5/35 fiber protein (a recombinant Ad5 vector expressing a 
modified fiber protein comprising of a fiber tail domain of Ad5 and the fiber shaft and 
knob domains of Ad35). The Ad5/35 chimeric fiber protein shows a broader spectrum of 
infection to a subset of CD34+ cells, including those with stem cell activity. The Ad5/1 1 
chimeric fiber protein (a recombinant Ad5 vector expressing a modified fiber protein 
comprising the fiber tail domain of Ad5 and the fiber shaft and knob domains of Adl 1) 
showed similar tropism. 
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In addition to the knob domain modifications, the invention describes the added 
advantage of modifying both the fiber shaft domain and the fiber knob domain to produce 
a shortened fiber protein. The length of the fiber shaft domain plays a key role in the host 
receptors used for viral vector entry into the host cell. To show this Ad5, Ad5/9, and 
5 Ad5/35 variants were constructed with long (22p«sheets) and short-shafted (7(J-sheets)- 
shafted fibers. These analyses demonstrated that efficient viral infection involving CAR 
as the primary receptor for Ad5, Ad5/9 requires a long-shafted fiber protein, whereas the 
cell entry strategy of Ad5/35 (which binds to an still uncharacterized non-CAR receptor) 
does not depend on the shaft length (Fig. 3). The modification in both the fiber shaft 
10 domain length (between 5 > 10 P-sheets) and the fiber knob domain (from a different 
Adserotype than the base vector is a novel mode of altering Ad vector tropism. 

To broaden the repertoire of cell types that Ad vectors can infect, a specific binding 
region, the G-H loop, within the knob domain has been newly identified herein to 

15 improve binding affinity and specificity. Alteration within this region will redirect the Ad 
vector to a desired cell type. For example, the invention describes the G-H loop sequence 
within the fiber protein knob domain, which can be replaced with heterologous peptide 
ligand sequences without affecting the functionally important tertiary structure of the Ad 
fiber knob domain, while changing the binding affinity and specificty of the vector (Figs. 

20 6,7). This G-H loop region is exposed on the central part of the knob surface and may be 
strategically a better site for incorporation of heterologous ligands than the peripheral H-I 
loop (Krasnykh, V. et al., 1998. J. Virol, 72:1844-52.) of the knob C-terminus (Michael, 
S. L, et al., 1995. Gene Ther., 2:660-8., Wickham, T. J. et al., 1996, Nat. BiotechnoL, 
14:1570-3.), which are the substitution sites used by others. Therefore, these G-H loop 

25 modifications within the fiber knob domain will allow the Ad vector to be redirected to 
infect a desired cell type, as long as the G-H loop ligand sequence binds to at least one 
surface protein on the desired cell type. Fig. 7 shows some possible substitutions. 
Example II J demonstrates that the virion tolerates the insertion of a cycling peptide (12 
amino acids) with a constrained secondary structure that allows the exposure on the knob 

30 surface. A defined ligand (RGD) can be inserted into the G-H and the H-I loop of an 
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Ad5 capsid that is ablated for CAR, and integrin tropism. 
potential advantage of this new insertion site. 
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Infectivity studies show the 



Use of the Vectors of the Invention for "Gene Therapy" 

The liver is the major organ for protein synthesis. Thereforean important goal of gene 
therapy is to target gene therapy vectors to the liver. To genetically correct many types of 
mutant proteins hepatocytes need to be infected with gene therapy vectors carrying a 
corrected transgene. Example II J describes a G-H loop substitution in the knob domain 
of the fiber protein with both RI and RIR (of the malaria circumsporozite surface 
protein) in a short shafted fiber protein which directs the vector to have affinity and 
specificity to hepatocytes. 

Example II K applies a similar protocol to alter the fiber knob domain in the G-H loop 
region with peptides that target the vector breast cancer cell lines (MDA-MB-435). These 
novel approaches to redirect vectors described in the invention allow lower doses of the 
gene therapy vectors to be administered with a higher safety profile. 

In example II L a protocol for preparing an adenovirus display library is described that 
uses the fiber knob protein to display a library of random peptide sequences within the G- 
H loop. This library of adenoviruses with modified fiber proteins is screened for affinity 
and specificity for a desired cell type. There are two main advantages of using this 
adenovirus display library to screen for target peptides that allow binding to a desired cell 
type over a phage display library system. First, once a ligand peptide is identified that 
binds to the desired cell type it is already in the vector of choice for gene therapy 
delivery. The peptide does not need to be engineered into another vector, as is the case 
for the phage display library vectors. This reduces the steps required to identify a targeted 
fiber protein for a desired cell type. The second advantage of this method, is that the 
adenoviruses are able to display multiple copies of the modified fiber protein on their 
capsid. This allows for dimerization and trimerization of the fiber protein with the host 
cell receptor. The multimerization of fibers proteins is a realistic, in vivo interaction of 
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the trimeric fiber protein with the host cell receptor. In contrast, phage vectors can only 
display one fiber peptide sequence on their surface, which significantly limits the ability 
of interaction with host cell surface receptors. 

5 C. A chimeric adenovirus vector with selective tropism: 

The chimeric vectors of the invention combine two vectors: an Ad.ITR and a Ad. fx 
where fx describes a modified fiber protein. A first generation adenovirus vector of 
serotype 5 is the base vector that carries a transgene cassette flanked by heterologous 

10 ITRs. These specific inverted terminal repeat sequences, such as AAV ITRs direct stable 
integration of the transgene cassette into the host genome as well as control predictable 
genomic rearrangements that occur during viral replication. This vector can also carry a 
modified fiber gene (described in Examples II). During replication predictable genomic 
rearrangements occur which result in the generation of a gutless adenovirus vector (e.g. 

15 AAd.AAV**) which expresses the modified fiber protein on its capsid. The modified fiber 
protein allows the gutless vector to be targeted to a selected cell type. The targeted 
vector is a gutless adenovirus vector devoid of adenoviral genes which can integrate its 
transgene into the host genome. The transgene cassette can carry reporter, selectable, or 
therapeutic genes. 



In one embodiment of the invention, the gutless targeted AAd.AAV* carries the reporter 
gene of pgalactosidase. AAd.AAV* ~BG). For easy in vitro selection of human and 
bacterial cells that are transduced with the hybrid Ad vector, a bacterial sequence for the 
origin of replication can be added to the hybrid Ad vectors. An example of this is 
25 AAd.AAV* -Snori, in which a SNori sequence is added into the transgene cassette. This 
site allows for G418 selection on cells infected with AAd.AAV* -SNori. This in vitro 
selection provides a tool to analyze the site of transgene integration and the flanking 
chromosomal regions. Fluorescent in situ hybridization (FISH) is an alternative method 
to confirm vector integration. 
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An advantage of the AAd chimeric vector for gene transfer is the efficient and stable 
integration of a large transgene cassette up to e.g. 22kb which is significantly larger than 
the capacity of retroviral vectors . This is of particular interest for gene therapy. For 
example, to ameliorate sickle cell anemia AAd.AAV* yglobin , an expression transgene 
cassette with the gamma-globin gene that targets and integrates, can be inserted into bone 
marrow stem cells for long term expression of the gamma-globin gene. 

To achieve site-specific gene integration, rep78 protein is used for transgene integration 
into the AAVS1 site (described in Example 1 D). However, this may silence transgene 
expression. To prevent the integrated transgene from being silenced by host genomic 
elements (such as positional effects or downstream enhancers), LCRs or insulator 
elements are incorporated into the transgene cassette. 

The following examples are presented to illustrate the present invention and to assist one of 
ordinary skill in making and using the same. The examples are not intended in any way to 
otherwise limit the scope of the invention. 

EXAMPLE I 

NOVEL ADENOVIRAL VECTOR Ad.AAV 

A. Integrating AAd. AAV hybrid vectors devoid of all adenoviral genes. 

In vitro and in vivo studies with rAAV indicate that the only requirement for rAAV 
integration are the AAV ITRs and as yet unknown, host cellular factors. It is thought that 
specific sequences or secondary structures present in AAV ITRs are prone to integration 
into host chromosomal DNA. In order to combine advantages of adenoviral vectors (high 
titer, high infectivity, large capacity) and the integration capability of AAV ITRs, AAV 
vector DNA with AAV ITRs flanking cassettes a secreted human placental alkaline 
phosphatase (SEAP) - neomycin phosphotransferase (neo) reporter gene cassette 
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(Alexander, I.E., et al. 1996, Gene Therapy, 7, 841-850) is incorporated into the El- 
Tegion of E1/E3 deleted adenoviral vectors (A&AAV1) (Figure 1, top). 



METHODS 

Production/Characterization of Viral Vectors 
Plasrnids : 

The AAV1 vector cassette containing AAV ITRs and SEAP/neo expression units is 
obtained by Asel/Scal digestion of the plasmid pALSAPSN (Alexander, I. E. et al, 1996, 
Human Gene Therapy, 7:841-50). The 4.4kb AAV vector fragment was cloned via NotI 
adapter linkers into pXJCLl (Mirobix, Toronto, Canada) (pAd.AAVl). Another shuttle 
vector (pAd.AAVl-A2ITRs) lacking the AAV ITRs is generated by inserting the 3.7kb 
Aflll/BsmI fragment of pALAPSN into pXJCLl. For pAd. AAV 1 A 1 FTR, a construct is 
used where a spontaneous deletion in the left AAV ITRs between the A and A 1 regions 
has occurred. To create a second hybrid vector (Ad.AAV2), the AAVSNori cassette 
developed by E. Rutledge is used. AAV vector DNA obtained is from pASNori 
(Rutledge, E. A. ? Russell, D.W. 1997. Journal of Virology 71:8429-8436) as a 3.4kb 
Bsal/Scal fragment and inserted into the EcoRV site of pXCJLl. As it is generally 
known for AAV vector plasrnids, the AAV ITRs are prone to rearrangements. To 
minimize deletions in these functional critical regions, all constructs for generation of 
hybrid vectors are assembled in low copy-number plasrnids which are grown in E. coli 
ToplO, JC811, or XL1 Bluecells (Stragene, La JoIIa, Calif.). Furthermore, after each 
cloning step or large-scale plasmid amplification, both AAV ITRs are carefully mapped 
by restriction analysis with enzymes that cut inside or adjacent to the ITRs (BssHH, 
Abdl, Smal, Bgll, BsmI, Aflll, and Seal). 
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First-generation viruses with the different transgene cassettes incorporated into the El 
region are generated by recombination of the pAElaSpla- or pXCJLl -derived shuttle 
plasmids and pJM17 (Microbix) in 293 cells as described earlier (Lieber, A., et al., 1996, 
J of Virology, 70, 8782-8791). For each virus, at least 20 plaques are picked, amplified, 
and analyzed by restriction digest. Viruses containing two AAV ITDRs tend to rearrange 
within the ITRs, with other adenoviral sequences, or with adenoviral sequences present in 
the 293 cell genome. Only plaques from viruses with intact ITRs are amplified, CsCl 
banded, and titered as described earlier (Kay, M. A., et al. 1995. Hepatology 21:815-819; 
Lieber, A., et al. 1996. Journal of Virology 70:8944-8960). All virus preparations tested 
are negative for RCA and bacterial endotoxin (Lieber, A., et al.1997. Journal of Virology 
71:8798-8807). Virus is stored at -80°C in lOmM Tris-Cl, pH 7.5-1 mM MgCl 2 -10% 
glycerol. 

To generate AAd.AAV, 293 cells are infected with Ad. AAV 1 at an multiplicity of 
infection (MOI) of 25 and harvested 40h after infection. Cells are lysed in PBS by 4 
cycles of freeze/thawing. Lysates are centrifuged to remove cell debris and digested for 
30 min at 37°C with 500 units/ml DNasel and 200 ^ig/ml RNaseA in the presence of 
lOmM MgCl 2 . 5ml of lysate is layered on a CsCl step gradient (0.5ml -. 1.5g/cm 3 , 2.5ml - 
1.35g/cm 3 , 4ml - 1.25g/cm 3 ) and ultracentrifiiged for 2h at 35,000 ipm (rotor SW41). 
CsCl fractions are collected by puncturing the tube and are analyzed for viral DNA 
(Lieber, A., et al. 1996. Journal of Virology 70:8944-8960; Steinwaerder, D. S., et al. 
1999. J Virol 73:9303-13) or subjected to ultracentrifugation at 35,000 rpm for 18 hours 
in an equilibrium gradient with 1.32 g/cm 3 CsCl. The band containing the deleted viruses 
AAd.AAV is clearly separated (0.5cm distance) from other banded viral particles 
containing full-length ad. AAV genomes. AAd.AAV 1 fractions are dialyzed against 
lOmM Tris-Cl, pH 7.5-lmM MgCl 2 -10% glycerol and stored at -80*C. The genome titer 
of AAd.AAV 1 preparations is determined based on quantitative Southern analysis of viral 
DNA purified from viral particles in comparison to different concentrations of a 4.4kb 
Asel/Scal fragment of pALSAPSN according to a protocol described earlier (Lieber, A., 
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et al., 1996, J. of Virology, 70, 8782-8791). In total, the production of lxlO 13 genome- 
particles of AAd.AAVl requires less than 3 hours of actual work. 



Titers routinely obtained are in the range of 3-8x1 0 12 genomes per ml. Assuming one 
genome is packaged per capsid, the genome titer equals the particle titer. The level of 
contaminating Ad. AAV 1 is less than 0.1% as determined by Southern analysis, which is 
consistent with results obtained by plaque assay on 293 cells (fewer than 5 plaques per 
10 6 total genomes). The primers used for sequencing the left and right ITR-vector- 
junction are 

S'GGCGTTACTTAAGCTAGAGCTTATCTG, and 
5'CTCTCTAGTTCTAGCCTCGATCTCAC. 

The recombinant AAV virus stock containing the SEAP/neo cassette (AV2/ALSAPSN, 
[Alexander, I. E. et al, 1996, Human Gene Therapy, 7:841-50] used in these studies were 
obtained from Dusty Miller (FHCRC, Seattle). The stock was free of contaminating 
replication competent AAV (<50 particles/ml) and wildtype adenovirus (<100 
particles/ml). The genome titer of the virus stock was obtained by quantitative Southern 
Blots as described by Russell et al. (Russell, D. et al. 1994 Proc. NatlAca^Scl USA 
91:8915-8919). 

Electron Microscopy: 

For examination of viral particles in the transmission electron microscopy studies, CsCl- 
purified virions. are fixed with glutaraldehyde and stained with uranyl acetate as described 
previously (Lieber, A., et al. 1 996. Journal of Virology 70:8944-8960). 

RESULTS 

During replication of these hybrid vectors in 293 cells, a 5.5kb genome (AAd.AAVl) is 
efficiently generated and packaged into adenovirus (Ad5) capsids. The AAd.AAVl 
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genome contains the left adenovirus ITR and the packaging signal followed by the AAV- 
vector cassette and a duplicate of the adenoviral packaging signal and ITR in reverse 
orientation (Figure 1, bottom). The hybrid vector is devoid of all viral genes, thus 
eliminating toxic effects and the elicitation of cellular immune responses. The 
spontaneous formation of the small hybrid vector genome AAd.AAVl requires the 
presence of two intact AAV ITRs and does not occur with partly deleted ITRs or oligo- 
dC and oligo-dG stretches flanking the expression cassette. 

Hybrid vectors containing different transzenes: 

To construct a hybrid vector with a transgene that can be detected in situ the SEAP/neo 
expression unit in Ad.AAVl is replaced by the E. coli P-galactosidase gene. This hybrid 
vector is named AAd.AAVl. During generation of the corresponding plasmid constructs 
the AAV ITR sequences tend to rearrange and abolish their functional properties. This 
problem can be circumvented by using low copy number plasmids as cloning vectors 
grown in bacteria strains depleted for all recombination proteins (e.g.JC811). 
Furthermore, the intactness of both AAV ITRs after each cloning step can be examined 
for characteristic endonuclease digestion. Recently, another hybrid vector 
AAd.AAVl Nori has been generated which contains the neo gene under the control of 
both the simian virus 40 (SV40) early promoter and the transposon 5 (Tn5) promoter for 
expression in human and bacterial cells, as well as the pi 5 A bacterial replication origin 
with the direction of the leading strand DNA synthesis opposite that of neo gene 
transcription. Thus, SNori can be used for G418 selection of integrated vector in 
eukaryotic cells as well as for rescue of vector together with flanking host DNA after 
integration. The recovered plasmids can be propagated in E. coli under selection with 
kanamycin due to the bacterial origin and the neo gene. SNori containing vectors allow a 
rapid estimation of total integration events based on the number of G418 resistant 
colonies. Moreover, vector DNA together with flanking chromosomal DNA can be 
rescued as plasmids from single G418 resistant clones and can be used for sequencing to 
determine integration junctions. Both hybrid vectors are produced at a titer of about 
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3x1 0 12 genomes per ml. The ratio of genome titer to transducing particles for 
AAd.AAVBG is -200:1 based on p-Gal expression. 

DISCUSSION 

AAd.AAVl could spontaneously form during adenovirus replication. Another possible 
mechanism of AAd.AAVl formation is based on the unique mechanism by which 
adenovirus replicates its genome (van der Vliet, B., 1995, In w. Doerfler, et al. (eds.) vol. 
2 p. 1-31, Springer- Verlag, Berlin) (see Figure 1). Ad DNA replication is initiated by the 
TP/pTP (terminal protein) that binds to specific sites within the ITRs on both ends of the 
linear genome and catalyzes, in complex with Ad pol, the binding of the 5' CTP, the first 
nucleotide of the daughter strand. DNA synthesis proceeds in a continuous fashion to the 
other end of the genome (Figure 1A). Only one of the DNA strands serves as template. 
One of the replication products is a single-stranded DNA that circularizes through 
annealing of its self-complementary ITRs. The resulting duplex "panhandle" has the 
same structure as the termini of the duplex viral genome that allows the binding of pTP 
and the initiation for synthesis of a complementary strand using the single-stranded 
"pandhandle" molecule as template (Figure 1C). In the case of Ad. AAV 1, the Ad pol 
synthesizes the single strand of the adenoviral genome starting from the left Ad ITR until 
it reaches the second AAV ITR. During synthesis of the second AAV ITR a certain 
percentage of the single-stranded molecules form a loop hybridizing to the 
complementary region within the first AAV ITR that was replicated earlier, allowing Ad 
pol to use the same viral DNA strand to read back towards the left ITR (Figure IB). The 
resulting "panhandle" structure can be resolved in a similar way as a full-length 
intermediate shown in Figure 1C, generating a double stranded, linear molecule with the 
above described structure that can be packaged into Ad virions. The ratio of viral DNA 
to protein concentration in purified AAd.AAVl particles is comparable to that obtained 
from Ad.AAVl particles. This indicates that despite the smaller size, only one 
AAd.AAVl genome is packaged, resulting in particles with a lighter buoyant density 
(-1.32 g/cm 3 ). Electron microscopy demonstrates the icosahedral shape of AAd.AAVl 
particles (Figure 2). Staining with uranyl acetate causes the central viral cores to appear 
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electron dense. AAd.AAVl virions have only a spotted luminal dark staining as expected 
with only one 5.5kb genome being packaged per capsid. 



B. In vitro AAd.AAVl production: 

Characteristics of deleted adeno-AA V vectors (AAdAA V): 

A number of experiments to clarify the mechanisms of AAd.AAV genome formation are 
carried out. Specifically, the presence of two intact AAV ITRs flanking a reporter gene 
cassette is required for the effective formation of AAd.AAV genomes. This process does 
not occur with partially deleted ITRs or oligo-dC and oligo-dG stretches flanking the 
expression cassette. Furthermore, in vitro transduction studies are performed with 
different genome titers of AAd.AAVl, Ad.AAVl, and Ad.AAVl-A2ITRs 9 (lacking the 
two AAV ITRs) which determine the number of G41 8 resistant colonies that formed after 
4 weeks of selection (Table I). 

AAAAAV1 is routinely produced at a high titer (5 x 10 12 genomes per ml with >10 4 
produced genomes per 293 cell) and at a high purity with less than 0.1% contaminating 
full length Ad.AAVl genomes by a technique normally used for amplification and 
purification of recombinant adenovirus. 

In vitro transduction studies with hybrid vectors on CD34+ cells and ervthroleukemia 
cells: 

In order to test whether the hybrid vectors allow for gene transfer into cell types, that 
have to be targeted for sickle cell therapy, infection/transduction studies are performed 
using CD34+ enriched human bone marrow cells, derived from mobilized peripheral 
blood and the human erythroleukemia cell line K562 which express s and y globin genes. 

METHODS 
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SKHepl cells (HTB-52, American Type Culture Collection, Rockville, MD), an 
endothelial cell line derived from human liver [Heffelfinger, S.C., et al., 1992, In vitro 
Cell Dev. Biol 28A, 136-4-142], are grown in high-glucose Dulbecco's modified Eagle 
medium with 10% fetal calf serum. SKHepl cells are analyzed for integrated AAV 
provirus by Southern analysis of genomic DNA using the AAV1 wild type genome 
obtained from pAAV/Ad (Samulski, R. J., et al. 1989. Journal of Virology 63:3822-3928) 
(gift from David Russell, University of Washington) as a probe. No specific bands are 
detected in undigested genomic SKHepl DNA or after digestion with Hindlll. For viral 
infection, confluent cells are incubated with different viral doses for 2 hours, followed by 
intensive washing. For G418 selection, 24h after infection with AAd.AAVl, SKHepl 
cells are trypsinized and plated at different dilutions under G418 selection (900ng/ml 
active compound, Boehringer-Mannheim, Germany). G41 8 containing culture medium is 
changed every 3 days. The number of colonies with >10 4 cells is counted after 4 weeks 
of selection and divided by the number of initially seeded cells. This ratio is used to 
express the integration frequency of AAd.AAVl. Single colones are obtained by limiting 
dilutions of infected cells in 96 well plates. Colonies are expanded to 1x1 0 6 cells in the 
presence of G418. Immunofluorescence analysis for adenoviral proteins expressed in 
SKHepl cells 3 days post-infection is performed as described earlier [Lieber, A., et al., 
1996, J. of Virology, 70, 8782-8791]. 

RESULTS 

293 cells are infected with the first generation vector Ad.AAVl. During replication of 
Ad.AAVl, the small AAd.AAVl genome forms spontaneously and is packaged into 
adenovirus capsid. At 36 hours after infection cells are harvested- and virus is released by 
several cycles of freeze/thawing. The mixture of Ad.AAVl and AAd.AAVl particles in 
the cell lysate is then separated by ultracentrifugation in a CsCl step gradient. Due to its 
lighter buoyant density, the band containing the AAd.AAVl particles is clearly separated 
(0.8cm distance) from the band containing full-length virus (Lieber, A., et al. 1999. J 
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Virol 73:9314-24). AAd.AAVl is purified further by an additional CsCl equilibrium 
gradient and is stored in lOmM Tris pH7.5, 10% glycerol, ImM MgCl 2 in 80'C. In total, 
the production of 2xl0 13 (genome) particles of AAd.AAVl requires less than 3 hours of 
work. All functions for AAd.AAVl replication and particle formation are provided from 
5 Ad. AAV 1 genomes amplified in the same cell. The efficiency of vector production 
measured on a genome-per-cell-basis is comparable or higher than labor-intensive, newer 
techniques for rAAV production, which have not yet been proven to be reliable. The 
estimated ratio of transducing/genome titer for AAd.AAVl is 1:200 (based on SEAP 
expression at day 3 post-infection), whereas for the average rAAV preparation, it is in the 

10 range of 1 :1 0 3 to 1 :10 4 . lxlO 5 confluent SKHepl cells are infected with different MOIs 
of rAAVl (stock: lxlO 10 genomes per ml), AAd.AAVl (stock: 5xl0 12 genomes per ml), 
Ad. AD AVI (stock: lxl 0 13 genomes per ml), and AdAAVl 2ITR (stock: 9x1 0 12 genomes 
per ml), in a volume of 100ml 24 hours after infection, cells are washed, trypsinized, and 
plated at different dilutions. G418 is added 24 hours after plating and selection is 

15 performed for 4 weeks. G41 8 resistant colonies contain on average >5xl 0 4 cells (at least 
16 cell divisions). A significant number of small colonies visible at 2 weeks post- 
infection do not survive continued selection, probably due to episomal vector expression. 
Cells infected with first-generation adenoviruses with MOIs greater than lxl 0 4 develop 
CPE during the first week of selection. The rAAV titer is not high enough to perform 
20 infection studies with MOIs greater than 10 4 . The colony formation is expressed as 
percentage of the number of colonies after selection to the number of cells initially 
seeded for selection (Table I). 

TABLE I 



25 

Formation of G418 resistant colonies after infection with hybrid viruses in 
comparison with rAAV. 



MOI 




Formation of G418 resistant colonies in % (SEM) 


(genomes 


per 


(after 4 weeks of selection) 


cell) 

■ 
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rAAVl 


.AiLAAVl 


AdAAVl 


AdtAAVl 
.2ITRs 


10 1 


0 


0 


0 


0 


10 2 


0 


0 


0 


0 


10* 


2.7 (1.6) 


1.3(1) 


5.4 (3.0) 


0 


10* 


90.8 (7.0) 


48.0 (8.9) 


12.9(7.2) 


0 


10* 


N/A 


93.1 (5.4) 


3.8(2.1) 


0 


10" 


N/A 


100 


0 


0 


10' 


N/A 


100 


0 


0 



N = 3 (SEM is indicated in parentheses.) 



K562 cells are infected with different MOIs oftsAdJLA VBG (1-lCP eenomes per cell): 

Three days after infection, the total number of viable cells (based on Trypan blue 
staining) and the percentage of infected cells (based on X-Gal staining) are determined 
for all MOIs. The results are presented in Figure 4A. 

Initial integration studies: 

K562 cells are incubated with AAd.AAVSNori at an MOI of 2x1 0 s genomes per cell and 
the colonies that formed after 4 weeks of G418 selection are counted in 96 well plates. 
G418 resistant colonies contain on average >5xl0 4 cells which means that the original 
cell underwent at least 16 cell divisions. 

Infection s tudies with .AdAAVBG fl-ltf genomes per cell) on CD34+cells: 

Cell infection on CD34+are as described for K562 cells. CD34+ cells are cultured in 
IMDM supplemented with 20% FCS, kit ligand (stem cell factor-SCP) (lOOng/ml), and 
DL-3 (lOOng/ml). Since a number of reports suggest that specific cytokines like GM-CSF 
or M-CSF which induce stem cell differentiation can stimulate integrin expression and 
may therefore affect internalization of Ad5 vectors, infection rates are compared with 
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Ad5 based hybrid vectors on CD34+ cells cultured with and without pre-stimulation with 
GM-CSF (50ng/ml) or M-CSF (50U/ml). The number of infected cells is counted based 
on X-Gal staining at day 3 after infection. To test for dose-dependent toxicity, viable 
cells are counted (based on trypan blue exclusion) at day 3 post-infection. Furthermore, 
whether high viral doses affect the ability of CD34+ cells to differentiate in methyl 
cellulose colony assays in presence of IL-3 and SCF is analyzed. The results are 
expressed as viable cells/X-Gal positive cells vs MOI (see Figure 4B). 

DISCUSSION 

The above data demonstrate that AAd.AAV transduces stably an immortalized human 
cell line with a low frequency comparable to rAAV, however, transduction rates could be 
scaled up to 100% by using greater MOIs of AAd.AAVl s which is produced at higher 
titers than rAAVl. In contrast to infection with the first-generation vector, Ad.AAVl, 
infection with AAd.AAVl is not associated with dose-dependent cytotoxicity because no 
viral proteins are expressed from these vectors in transduced cells. Furthermore, viral 
proteins present in the incoming AAd.AAVl particles are not problematic in the dose 
range use. The comparison of transduction rates of AAd. AAV/Ad. AAV 1 with the vector 
lacking AAV ITRs, Ad. AAV-A2ITRs, supports the hypothesis that the presence of two 
intact AAV ITRs is crucial for hybrid vector integration. 

The data demonstrate that the leukemia cell line can be infected at -90% efficiency with 
Ad5 based hybrid vectors at MOIs 2xl0 5 genomes per cell without significant toxic side 
effects. However, this dose is still -100 times greater than the dose necessary to infect 
100% of HeLa cell, hepastoma cells, primary hepatocytes and other cell lines generally 
considered as permissive for Ad5 vector infection. 

Since viral DNA in cells infected with 2xl0 4 genomes (or 100 transducing particles per 
cell) should be lost after 7 cell divisions, the presence of G418 resistant cells in the 
observed colonies suggests that AAd.AAVSNori genomes are integrated into or stably 
associated with the host genome. Based on the number of G418 resistant colonies one 
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out of 25,000 AAd.AAVSNori genomes integrates stably into K562 cells. This is in 
agreement with the results obtained earlier with AAd.AAVl in SK Hepl cells. 

The maximal dose used for infection of CD34+ cells (1x10 s ) results in X-Gal staining of 
only -10% of cells independently of GM-CSF/M-CSF. This demonstrates the obvious 
inability of Ad5 to infect CD34+ cells and is probably caused by the absence of specific 
receptors and/or integrins on the cell surface. CD34+ cells tolerate a large range of viral 
doses (1-1 0 7 ) without obvious effects on cell viability and total cell number. This is not 
surprising because in order to develop toxic side effects adenovirus has to enter the cell 
and express viral genes. Hybrid vectors can be produced at titers of 5x1 0 12 genomes per 
ml. Thus, the maximal MOI that can be used for infection (of 10 4 cells) is -SxlO 8 (in 
100nl storage buffer). Based on the infection studies with AAd.AAVBG this dose may 
not be sufficient to efficiently transduce CD34+ cells and to obtain an appreciable 
number of G4 1 8 resistant colonies. 

C. In Vivo Properties Of AAd.AAVl: 

Viral DNA is labeled with BrdU during virus amplification to investigate cellular/nuclear 
vector uptake in situ. For transduction studies, confluent SKHepl cells (a human 
endothelial cell line) are infected with 2000 genomes AAd.AAVl or Ad.AAVl per cell. 
BrdU tagged viral DNA is detected in 100% of nuclei at 3 hours post-infection for both 
viruses indicating efficient cellular and nuclear uptake of hybrid virus DNA. 

RESULTS 

The AAd.AAVl vector transduces a cell in vitro forming G41 8 resistant colonies with an 
efficiency of 17 or 58%, after infection with an MOI of lxlO 3 or lxlO 4 genomes per cell, 
respectively. Approximately 2xl0 4 AAd.AAVl genomes are required to yield one stable 
transfectant. Since all stable colonies contain integrated AAd.AAVl vector DNA, this 
number reflects the minimal integration frequency of AAd.AAVl in SKHepl cells which 
is comparable with that from rAAV (Rutledge, E. A. et al., 1997, Journal of Virology, 
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71 :8429-36). The number of G418 resistant colonies does not necessarily represent the 
total frequency of integration events because not all integrated copies express neomycin 
phosphotransferase, due to chromosomal position effects or incomplete integration. 

The absence of adenoviral gene products ia AAd.AAVl transduced cells at day 3 post- 
infection is demonstrated by immunofluorescence with antibodies to the major late 
proteins (hexon, fiber) and early proteins (DBP.E4-orf6). Expressed adenoviral proteins 
are detected only in cells infected with Ad. AAV 1. The fact that cells infected with 
AAd.AAVl do not express potentially cytotoxic adenoviral proteins is important. 

While an MOI of lxl 0 4 genomes per cell of the first generation vector Ad. AAV 1 induce 
cytopathic effects in SKHepl cells at day 3 p.i., no toxic side effects are observed when 
SKHepl cells are infected with AAd.AAVl at a dose of up to 1x10 s genomes per cell. 
Since the transduction efficiency is clearly dose dependent, AAd.AAVl (which can be 
produced at titers of >5xl0 12 genomes/ml) is able to stably transduce Ad5 permissive cell 
lines or tissues with a 100% efficiency without associated toxicity. 

Southern analysis indicates that AAd.AAVl integrates randomly as head-to-tail tandem 
repeats into the host cell genome via the right AAV ITR, whereas the other junction with 
the chromosomal DNA is variable and occurs somewhere within the transgene cassette. 
In order to confirm the integrated status of AAd.AAVl DNA, high-molecular- weight 
chromosomal DNA is separated by pulse field gel electrophoresis (PFGE), followed by 
Southern analysis with a SEAP specific probe (Figure 3). Undigested DNA from control 
SKHepl cells give an endogenous SEAP signal that co-migrates with chromosomal DNA 
just below the well (lanes 1 and 5). No high-molecular weight episomal forms of 
AAd.AAVl DNA are detected, whereas a distinct 35 kb band is visible in DNA from 
SKHepl cells isolated 3 days after infection with first generation adenovirus, Ad.AAVl 
(lanes 4 and 13). Digestion with EcoRI reveals the 4.4kb fragment, which is specific for 
integrated tandem copies of the AAV cassette (lanes 8 and 12). To eliminate the 
possibility that chromosomal DNA is trapped in the well, DNA samples are digested with 
intron-encoded endonucleases Pl-Scel or I-Ceul (Gibco-BRL, Grand Island, NY) with a 
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sequence specificity or more than 11 bp or 9bp respectively. Digestion with Pl-Scel 
yields a >2mb endogenous SEAP signal in SKHepl cells (lane 2) and an additional signal 
in the range of — Imb in G418 resistant colonies transduced with AAd.AAVl (lane 7). I- 
Ceul digestion results in a smear between 250-1000 kb in AAd.AAVl transduced 
5 SKHepl -cells (lanes 10, 11) indicating random integration, whereas a high-molecular 
weight band specific for the endogenous SEAP gene is observed in control SKHepl cells 
(lane 9). 

One day after intraportal infusion of lxlO 12 AAd.AAVl genomes in C57B1/6 mice, BrdU 

10 labeled vector genomes can be detected in 85% hepatocytes (Lieber, A., et al. 1999. J 
Virol 73:9314-24). Hepatocellular DNA analysis performed at 2 months post-infusion 
reveals AAd.AAVl DNA integrated with an average of 0.5 copies per cell into the mouse 
genome (Lieber, A., et al. 1999. J Virol 73:9314-24). To assess potential side effects of 
intraportal AAd.AAVl infusion, serum glutamic pyruvic transaminase (SGPT), a 

15 sensitive marker for hepatocellular injury, is measured for 7 consecutive days post- 
infusion in combination with histological analysis of liver sections. No significant 
elevation in SGPT levels, or histological abnormalities are detected after intraportal 
infusion of 1x10 12 or lxlO 13 AAd.AAVl genomes, whereas infusion of the same dose of 
full-length Ad. AAV 1 vector is associated with severe hepatoxicity or fatal outcome. This 

20 suggests that the dose of AAd.AAVl administered to mice can be increased to obtain 
higher transduction efficiencies in vivo without adverse side effects, which is not possible 
for first generation adenoviruses. Importantly, AAd.AAVl transduced quiescent 
hepatocytes in vivo, which suggests that integration of hybrid vector DNA may not 
require cell proliferation. Recently, more detailed in vivo transduction studies with 

25 Ad.AAVl and AAd.AAVl have been performed in Balb/c mice to study whether the 
absence of adenoviral gene expression in cells infected with AAd.AAVl can avoid an 
anti-viral immune response and can prolong vector persistence. In this mouse strain, 
vector DNA is cleared from the liver at 4-6 weeks after infusion with first generation 
adenoviruses, mostly due to a CTL response against viral proteins produced in transduced 

30 cells. Vector DNA is analyzed by genomic Southern Blot of hepatic DNA at 12 weeks 
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after infusion of lxlO 12 genomes Ad.AAVl or AAd.AAVl. At this time point, no vector 
specific signal is detectable in hepatic DNA from mice infused with the first generation 
vector Ad.AAVl, while -0.3 copies of AAd.AAVl genomes per cell are present in livers 
of mice that received the hybrid vector, again indicating the superior in vivo properties of 
the hybrid vector. 

D. Effe cts of Ren Coexnression on AAd.AAV Integration 

Rev expression after plasmid (ransfection: 

In order to test whether Rep expression enhances site-specific integration of AAd.AAVl 
in human cells, a series of Rep expression plasmids are constructed. 

METHODS 

The Rep ORF 68/78 (nt 285-2313) including the internal pl9 and p40 promoters is 
obtained from pAAV/Ad (Samulski, R. J. et al., 1991, In B. N. Fields, et al. (eds.), 
Fields Virology, vol. 2 Lippincott-Raven Publisher, Philadelphia) by digestion with 
Bsal/BsrI. This fragment deleted for the AAV p5 promoter is cloned via adapter linkers 
under RSV or PGK promoter in front of the bovine growth hormone polyadenylation 
signal (bPA) into pAd.RSV or pAd.PGK (Lieber, A., and Kay, M.A., 1996, J, of 
Virology, 70, 3153-3158; Lieber, A., et al., 1995, Human Gene Therapy, 6, 5-11) 
correspondingly. 

RESULTS 

The resulting plasmids (pRSVrep, pPGKrep) are transfected into 293 cells or SKHepl 
cells, most of the Rep proteins expressed from the heterologous promoters (RSV or PGK) 
are Rep 68 and Rep 78, while transfection of the rep gene under aP5 promoter 
(pAAV/Ad) results in predominant Rep 52/40 expression. Thus, transfection of pRSVrep 
and pPGKrep is more pronounced suggesting a strong transactivation of AAV promoters 
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by Ela which is produced in 293 cells. This result indicates that minimum expression of 
rep proteins is necessary to avoid interference with adenovirus replication. 

Rep-mediated site-specific integration of Md A AVI . 

The potential for site-specific integration is an important characteristic of the novel 
Ad. AAV vectors of the invention. In an embodiment of the invention, integration of the 
AAd.AAV is directed by co-infection with Ad AAV expressing the rep 78 protein to 
achieve site-specific integration in the AAVS1 site on human chromosome 19. For this 
type of site-specific integration to occur in cells other than 293 cells, E4 ORF6 
expression is required. The co-infection of AAd.AAV, AAd. rep 78, and AAd. E4-orf6 
allows for site specific integration of the AAd.AAV transgene cassette. The AAd. rep78 
and the AAd. E4-orf6 genomes are degraded soon after transduction, thus avoiding 
potential side effects. Site-specific integration is preferred over random integration, 
which is seen with rAAV and AAd.AAV, in order to reduce the risk, of insertional 
mutagenesis. 

A preliminary test can be performed to confirm the functional activity of Rep 68/78 
expressed from pRSVrep to mediate site-specific integration of AAd.AAV 1 (Figure 5 and 
6). Human SKHepl cells are transfected with pRSVrep or control plasmid (pRSVbGal 
(Lieber, A., et al., 1995, Human Gene Therapy, 6, 5-11) (transfection efficiency was 
-20%), followed by infection with AAd.AAV (2000 genomes per cell). Three days after 
infection, cells are trypsinized, embedded in agarose, lysed in situ, digested with I-Ceul 
(an intron-encoded endonuclease with a recognition sequence of more than lOnt), 
subjected to pulse file gel electrophoresis in 1% agarose gel, and analyzed by Southern 
Blot. Hybridization with a probe covering the AAVS1 integration site (1.7kb 
EcoRI/BamHI fragment from the chromosome 19 locus (Samulski, R. J. et al., 1991, In 
B. N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott-Raven Publisher, 
Philadelphia) ) reveals an AAVS1 -specific band (~240kb) in I-Ceul digested DNA from 
cells after control plasmid transfection (pCo) + AAd.AAV 1 infection. An additional 
signal in the range of 280kb appears in rep expressing cells infected with AAd.AAV 1 
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(pRSVrep + VAd.AAVl) indicating a site-specific insertion into the AAVS1 site in a 
certain percentage of cells. The presence of vector DNA in this 280kb band is confirmed 
by rehybridization of the same filter with a transgene (SEAP) specific probe. Randomly 
integrated AAd.AAVl vector appears as a diffuse SEAP signal in the range 280-680kb 
5 (pCo+AAd.AAVl, pRSVrep+AAd.AAVl). The specific -1.9 mb band on blots 
hybridized with the SEAP probe represents an 1-CeuI fragment containing the 
endogenous human SEAP gene. 

Incorporation Rep 68/78 function into hybrid vectors to stimulate site-specific integration 

10 Rep overexpression inhibits adenovirus DNA replication, prohibiting the generation of 
rep expressing Ad vectors using conventional strategies. To solve this problem, 
significant Rep 68/78 expression from the hybrid vector in virus producer (293) cells 
must be prevented while maintaining transient Rep expression in target cells (HSC) to 
mediate site-specific integration. Our hypothesis is that the specific structure of the 

15 AAd.AAV hybrid virus can be used to bring the rep gene 68/78 into a transcriptionally 
active position under control of a HSC specific promoter only at late stages of virus 
replication in 293 cells. This will allow amplification of the hybrid vector in 293 cells, 
generating high titer virus which activates the incorporated Rep 68/78 functions only in 
HSC. The general outline of our strategy to produce Rep expressing hybrid vectors is 

20 illustrated in Figure 7. The rep/transgene cassette is assembled based on the left-hand 
shuttle plasmid used for recombinant adenovirus production. The gene encoding Rep 
68/78 is cloned in 3*D5 ! orientation in front of a transgene expression cassette flanked by 
AAV ITRs. Between the transgene cassette and the right AAV ITR an HSC-specific 
promoter is inserted with direction towards the adenoviral E2, E3, and E4 genes. The 

25 recombinant genome is produced by recombination in E.coli and transfection into 293 
cells generates virus (Ad.AAV-rep). The specific structure of-4Ad.AAV with duplicated 
sequences flanking the AV ITRs is used to bring the rep gene into a transcriptionally 
active position under control of a HSC specific promoter only during late stages of viral 
DNA replication in 293 cells. During amplification of Ad.AAV-rep, the smaller genome 

30 i4Ad.AAV-rep is formed and packaged into particles, which can be separated by 
ultracentrifugation in CsCI gradients. The specific structure of AAd. AAV-rep brings the 
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rep gene into 5V+3* orientation in relation to the HSC specific promoter, allowing rep 
transcription in target cells. After transduction of HSC with purified <4Ad.AAV-rep 
particles, rep expression is activated and mediates rescue of the AAV-ITR/transgene 
cassette from the adenoviral vector backbone and site-specific integration. The 
5 hypothesis is that Rep-mediated integration into AAVS1 occurs via the right or both. 
AAV/ITRs causing the rep gene to become separated from the hepatocyte-specific 
promoter once the vector is integrated (Figure 7). Therefore, rep expression should be 
only transient without critical cytotoxic side effects on the host cell. 

10 Promoters that can regulate rev expression: 

Potential candidate promoters to drive rep expression with high specificity for HSC and 
minimal activity in 293 cells are the 454nt CD34 promoter (Krause, D.S., et al., 1997, 
Experimental Hemotology, 25, 1051-1061; Yamaguchia, Y. et al., 1997, Biochimica et 

15 Biophysica Acta., 1350:141-6), the. 300nt HS 40 enhancer (Chen, H.L., et al., 1997, 
Nucleic Acids Res. 25, 2917-2922) or a 3kb CD34 enhancer (May, G. et al., 1995, EMBO 
»/., 14:564-74) in combination with an initiator, or the HIV LTR. An optimal promoter is 
selected based on studies of transient reporter gene expression after plasmid transfection 
in 293 cells and hepatocytes. All promoters to be tested are cloned in front of the human 

20 cxi -antitrypsin (hAAT)- bovine growth hormone polyadenylation signal (bPA) into the 
adenoviral shuttle plasmid pCD2 (pAd.-hAAT). Promoter activity can be tested in 
transient plasmid transfection assays in CD34+ and 293 cells. The promoter with the 
highest hAAT levels in CD34+ or K562 cells and the lowest hAAT expression in 293 
cells is selected for further studies. If high background expression in 293 cells from these 

25 promoters is seen, insulators to shield HSC-specific promoters from the Ela enhancer 
which is still present in Ad shuttle plasmids can be utilized. 

Rep genes : 

30 The large Rep 68/78 proteins are sufficient to mediate rescue and site-specific 
integration. Unregulated Rep 52 and Rep 40 expression from the AAV pi 9 promoter 
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located within the ORF of Rep 68 and 78 must be prevented because production of these 
smaller Rep proteins in 293 cells will affect cell viability and adenoviral DNA synthesis. 
To do this, constructs obtained from Surosky et al., containing a mutated Rep 52/40 start 
codon to express Rep 68 and 78 individually under CMV promoter can be used. The 293 
5 cells transiently expressing Rep68 or Rep 78 from these constructs can be coinfected with 
AAd.AAVl (infection 24 hours after pCMVRep transfection, MOI 2xl0 5 genomes/cell). 
Three days after AAd.AAVl infection, cellular DNA is analyzed for AAVS1 -specific 
integration events by PCR and PFGE as described earlier. Efficient Rep mediated 
excision of the AAV cassette and site-specific integration without flanking adenoviral 
10 sequences are expected and the plasmids pCMVRep68 or pCMVRep78 can be used as a 
source for the corresponding rep genes and clone them into hybrid vectors. 

Vectors: 

15 The rep/transgene cassette can be assembled based on pXCJL (Microbix, Toronto). A set 
of control hybrid vectors can be generated with the AAV-ITR-transgene cassette only 
without the rep gene. The recombinant Ad.AAV-rep genome can be generated by 
recombination of the left hand shuttle plasmids with pCDl, a pBHGlO (Microbix, 
Toronto) derivative, which contains the Ad5 genome deleted for the E1/E3 regions in 

20 recA + E. coli (Chartier, C, et al., 1996, J. of Virology, 70, 4805-4810). Compared to the 
standard technique based on plasmid recombination in 293 cells, this approach has the 
advantage that plaques with recombinant virus appear 3 times faster and the production 
of illegitimate recombinants is minimized. This allows efficient viral DNA amplification 
and packaging to occur before Rep expression reaches levels that are potentially 

25 inhibitory for adenoviral replication. The critical variables in maximizing the output of 
the vector deleted for all adenoviral viral genes are the initial multiplicity of infection and 
the time of harvesting. These parameters can be optimized for production of AAd.AAV- 
rep hybrid vectors. A number of AAd.AAV vectors can be constructed incorporating rep 
gene. Cryptic promoter and enhancer elements present in the 5'-342nt of the adenoviral 

30 genome can interfere with transgene expression from the heterologous promoters. This is 
crucial for the strategy to avoid rep expression from AAd.AAV-rep genomes in 293 cells. 
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To ensure efficient transgene expression, insulator fragments such as the chicken beta- 
globin insulator can be used with a selected promoter, constitutive or inducible. 

Rep protein co-vac kapinp: 

5 

As an alternative to producing hybrid vectors containing the rep 68/78 gene, studies are 
designed to see whether Rep protein can be co-packaged into AAd.AAV capsids and 
whether these co-packaged Rep molecules are sufficient to mediate rescue and site- 
specific integration of the AAV-ITR-transgene cassette. Our hypothesis is that the Rep 

10 68/78 binds to the Rep binding site (RBS) present in double-stranded AAd.AAV genome 
and that this complex is co-packaged into adenoviral capsids which are spacious enough 
to accommodate extra proteins. Based on protein/DNA ratio analysis performed 
previously in purified particles that only one 5.5kb AAd.AAV 1 genome is packaged per 
capsid. This is confirmed by electron-microscopy of AAd.AAV 1 particles, which reveals 

15 only spotted electron-dense staining associated with viral cores and extended free luminal 
space (see Figure 2). 

293 cells are transfected with plasmids expressing Rep 68/78 under the CMV promoter 
and the kinetics of rep expression is determined by Western Blot with cell lysates 

20 collected at different time points after transfection. Next, these 293 cells are infected 
with Ad. AAV (MOI 1, 10, 100 pfu/cell) at specific time points after transfection of Rep 
plasmids depending on the Rep expression kinetics (e.g. 3, 6, 12, 24 . . . hours after 
transfection). It is important to time Ad.AAV infection exactly because viral DNA 
replication must be taking place or finished before Rep production reaches peak levels. 

25 In general, adenovirus DNA replication in 293 cells (infected with MOI 10) is maximal at 
18 hours post-infection, followed by production of structural proteins, packaging of viral 
genomes, and breakdown of cellular membrane structures (which is concluded ~36-48h 
p.i.) (Shenk, T., 1996, In B. N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott- 
Raven Publisher, Philadelphia; van der Vliet, B., 1995, In w. Doerfler, et al. (eds.) vol. 2 

30 p. 1-31, Springer- Verlag, Berlin). Viruses are collected 48h after infection and banded 
by CsCl ultracentrifugation. Viral material from purified bands corresponding to 
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AAd.AAV is lysed, DNAse-treated (to liberate DNA associate Rep) and subjected to 
immunoprecipitation- Western Blot with Rep specific antibodies to detect co-packaged 
Rep. Based on theoretical calculations assuming that two Rep molecules bind per Ad 
genome, ~l-10ng Rep proteins is expected from Lysates of 10 10 particles, which is within 
5 the range of detectability by Western Blot. Alternatively, co-packaged Rep may be 
detected based on its functional activity to mediate rescue and site-specific integration of 
the AAVITR transgene cassette. To test whether functional Rep protein is co-packaged 
into hybrid vector particles, CsCl purified AAd.AAYl particles generated in 293 cells co- 
expressing Rep after Ad/AAV 1 infection (AAd.AAVl+Rep) can be used for transduction 
10 studies. Three days after AAd.AAVl+Rep infection of the human cell line K562, cellular 
DNA is analyzed for AAVS1 -specific integration events by PGR and PFGE. If efficient 
Rep-mediated site-specific integration of excised AAV cassettes is successful, then other 
AAd.AAV+Rep hybrid vectors with P-Gal and SNori as transgenes can be produced. 

15 Integration studies with Rep vectors in ervthroid cells: 

The hypotheses behind the rational of a rep-expressing hybrid vector (AAd.AAV-rep) 
are: (1) transient Rep co-expression from AAd.AAV-rep vectors can enhance site- 
specific vector integration in human cells and (2) integration occurs via the AAV ITR(s) 

20 without the rep gene, which is placed outside the AAV cassette, thus eliminating rep 
expression upon vector integration. To test hypothesis 1, transduction frequencies of 
AAd.AAV/rep versus AAd.AAV vectors can be compared based on the formation of 
G418 resistant colonies and quantify site-specific integration events at different time 
points after infection of human and mouse cells by PFGE and PCR. To test hypothesis 2, 

25 the structure of integrated vector in transduced cell populations and single clones can be 
delineated by Southern analysis and by sequencing of vector/chromosomal DNA 
junctions. These studies can be performed with AAd.AAV-rep,AAd.AAV, and 
AAd.AAV+Rep (copackaged protein) in human K562 or HEL (for AAVS1 integration) 
and mouse MEL cell lines. 

30 
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Cells infected with AAd.AAV-SNori, AAd.AAV-SNori+Rep or AAd.AAV-Snori-rep can 
be subjected to G418 selection. The number of G418 resistant colonies determined after 
4 weeks of selection in relation to the number of initially infected cells. The selection 
process for colonies that did not survive continued selection due to potential rep-mediated 
5 cytotoxcicity or episomal vector expression can be monitored. If rep expression from 
AAd.AAV-SNori rep does not affect cell viability and proliferation, then more G418 
resistant colonies should appear in AAd.AAV-SNori-rep and AAd.AAV-Snori+Rep. The 
structure of integrated vector can be determined by Southern Blot and sequencing of 
integration junctions. 

10 

To uncover a potential selection bias against rep producing cells after transduction with 
AAd.AAV/rep, site-specific and random vector integration events can be quantitated in 
cellular DNA isolated from cell populations at different time points after infection (e.g. 
0.5, 1, 3, 7, 14 days). To do this, the techniques based on PFGE-Southern can be utilized. 
15 It is expected that the signal(s) for AAVS1 -specific integration in AAd, AAV/rep infected 
human cells increases during the first days after infection and then remains constant over 
time. 



In a separate study, the integration status of vector DNA (analyzed by PFGE or PCR) and 
20 the number of integrated copies (analyzed by Southern Blot) with the expression level of 
P-galactosidase in single clones transduced with (5-GaI hybrid vectors (AAd.AAV-BG, 
AAd.AAV-BG+Rep, or AAd.AAV-BG-rep) can be correlated. Together with data 
obtained in the studies described in the Specification, this allows assessment of whether 
transcriptional silencing is associated with site-specific vector integration into the 
25 AAVSlsite. 

It is not clear a priori whether the specific Rep function for vector rescue, 
concatemerization, and integration can efficiently occur in non-S-phase or non-dividing 
cells. To test whether AAd.AAV* , AAd.AAV or AAd.AAV-rep/+Rep vectors can 
30 integrate into non-dividing cells, transduction studies in cell cycle arrested cell cultures 
can be performed as described earlier. 
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DISCUSSION 

The establishment of stable cell lines expressing Rep 68/78 at detectable levels is not 
possible, which is probably due to rep mediated cytotoxicity. Therefore, it is not possible 
to perform long-term transduction studies (e.g. G418 selection or studies in single clones) 
in combination with ectopic rep expression. Moreover, due to the inhibitory effect of rep 
on adenovirus replication, it is currently not possible to generate adenoviral vectors 
expressing rep under the RS V or PGK promoter. 

Taken together, this indicates that co-expressed Rep may stimulate site-specific transgene 
integration. 

E. A Detailed Study Of Transduction/Integration Of Hybrid Vectors In Erythroid 
Cell Lines: 

In order to improve transduction and integration frequencies of the hybrid vectors into 
erythroid cell lines, a detailed study comparing various hybrid vectors have to be carried 
out as described below. The transduction studies are performed in K562 cells which is 
considered to be an adequate model to study gene transfer vehicles into erythroid cells 
(Floch, V., et al., 1997, Blood Cells, Mol and Diseases.23 9 69-87). The optimal vectors 
should be able to integrate into the cellular genome with a high frequency, determined by 
Pulse field gel electrophoresis (PFGE) and Southern blot as described in Example 4. In 
addition, the results from the following studies will serve to evaluate whether a given 
hybrid vector needs to be modified for site-specific integration in the host genome. 

Sequencing of integration junctions: 

The ultimate proof for vector, integration is the sequencing of junctions between SNori 
vector DNA and chromosomal DNA. Furthermore, this clarifies the question whether the 
AAV ITRs represent the substrate for integration. Specifically, DNA from clones with 
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known AAd.AAVSNori integration structure (analyzed by Southern Blot) digested with 
EcoRI, which does not cut within the SNori cassette. The resulting fragments are 
circularized and transformed into a specific E. coli strain (according to the protocol 
described by Rutledge and Russell (Rutledge, E. A. et al., 1997, Journal of Virology, 
5 71:8429-36)). Kanamycin resistant bacterial clones should contain the integrated SNori 
cassette. Flanking chromosomal DNA in rescued plasmids can be sequenced with 
primers specific to the transgene. 

To confirm vector integration in a small number of transduced cells, genomic DNA is 
10 extracted and digested with EcoRI. EcoRI fragments are ligated to linkers containing a 
specific primer binding site and are then digested with NotI, religated and propagated in 
E. coli, Plasmid DNA from a representative number of bacterial clones is sequenced to 
determine the vector/chromosomal DNA junctions. 

15 Dose dependent toxicity: 

In order to test that the transduction frequence is dose-dependent and AAd.AAV vectors, 
which are devoid of all adenoviral genes, could be used to infect cells at higher doses 
with less cytotoxicity than first generation adenovirus, K562 cells are infected with 

20 different MOIs (1-10 8 ) of AAd.AAVBG and the first generation vector Ad.AAVBG 
(which contains the same P-Gal expression cassette). At day 4 post-infection, the total 
number of cells, the percentage of viable cells (based on trypan blue exclusion) and the 
percentage of X-Gal positive cells are counted. A fraction of infected cells are quantified 
for P-Gal expression using the Galacto-Light kit. The level of transgene expression is 

25 expected to be comparable between the two vectors. K562 cells are predicted to tolerate 
higher doses of AAd.AAVBG better than Ad.AAVBG which express viral genes. 

Integration frequency with and without G418 selection: 

30 In order to investigate the integration frequency of the different vectors and to confirm 
that AAV ITRs present in double-stranded adenoviial DNA genomes can mediate vector 
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integration with a frequency comparable to rAAV vectors, integration studies are 
performed based on the formation of G418 resistant colonies with AAd.AAVSNori, 
AdSNori, Ad.SNorirTR, and rAAVSNori after infection with 2xl0 5 and 2xl0 6 genomes 
per cell (Fig. 8). After infection, cells are plated in 96 well plates under limiting dilution 
and selected with G418 to estimate the frequency of formation of G418 resistant colonies. 
Another set of cells is plated without G418. A representative number of clones (w/ and 
w/o G418 selection) are expanded to >10 6 cells (after 3-4 weeks of culture) and analyzed 
for the presence of viral DNA by Southern Blot as well as PFGE analysis to discriminate 
between episomal vector DNA and vector genomes stably associated with chromosomal 
DNA. This allows us to estimate the integration frequency of the different vectors, to 
assess the effect of G418 selection on integration, and to consider position effects on neo 
expression in calculating the total integration frequency. Integrated vector copies with a 
frequency of at least IxlO" 4 is predicted only for AAd.AAVSNori and rAAVSNori. The 
total number of colonies may be lower in both the first generation vectors, 
Ad.AAVSNorilTR and . Ad.SNori, due to the toxic effects of expressed adenoviral 
proteins; however, a higher integration frequency is predicted for the vector containing 
the AAV ITRs (Ad.AAVSNoriITR). 

Kinetics of integration: 

Compared to rAAV, the double-stranded nature of entering AAd.AAV genomes provides 
more protection against degradation. Furthermore, the synthesis of transcriptionally 
active double-stranded intermediates from single-stranded genomes, which is considered 
a limiting step in rAAV transduction, is not required in AAd.AAV transduction. Thus, 
the lag phase between infection and expression seen with rAAV vectors, which is 
causally linked to double-strand synthesis/integration may be shorter or absent in 
infections with AAd.AAV vectors. Furthermore, it was demonstrated earlier that a 9kb 
mini-adenoviral genome packaged into adenoviral particles is only short lived and 
completely degraded by day 3 post-infusion. In contrast, transduction with the 5.5kb 
AAd.AAVl (Figure 1) genome allows for long-term expression, suggesting that either 



54 




WO 00/73478 PCT/US00/1S442 

AAV ITRs can stabilize the viral genome as an episome until it is integrated or 
integration occurs shortly after infection. 

The status of vector DNA can be examined in K562 cells at different time points after 
5 infection with AAd.AAVSNori, AdSNori, Ad.AAVSNorilTR, or rAAVSNori (MOI 
2xl0 5 ). Infected cells are harvested at 1 hour, 5 hours, 1 day, 3, 7, and 14 days after 
infection and chromosomal DNA is analyzed by PFGE followed by hybridization with a 
transgene specific probe. This technique allows us to distinguish between episomaJ 
vector DNA, which appears as a distinct 5.0kb band and integrated DNA. Furthermore, 

10 extra chromosomal high-molecular weight vector concatemers can be detected. In the 
case of random integration, after digestion of chromosomal DNA with I-Ceul or Pl-Scel, 
vector-specific signals in the range of l-2mb should be seen. The intensity of episomal 
and integrated vector signal is quantified for each time point using phosphoimager 
analysis. This gives information about the kinetics of hybrid vector integration in a 

15 population of infected K562 cells and the intracellular stability of hybrid vector genomes. 

Structure of integrated vector DNA and integration junctions with chromosomal DNA: 

AAd.AAVl integrates as concatemer/s randomly into host DNA as shown previously. 

20 How many vector copies are present in one concatemer and whether the extent and the 
kinetics of tandem-formation are dose dependent still remain unclear. Another 
unanswered question is how AAd.AAV integrates: whether one or both ITRs are 
involved, whether the integrated ITRs are still intact, and whether adenoviral sequences 
integrate as well. These issues are important for the strategy to include rep genes into the 

25 hybrid vector genome. Moreover, if intact AAV ITRs are present within integrated 
vector copies, helper virus (adenovirus or HSV) infection in vivomay mobilize the 
integrated AAV-ITR vector cassette and affect stability of transgene expression. 

To answer these questions, K562 cells can be infected with AAAAAVSNori, AdSNori, 
30 Ad.AAVSNorilTR, and rAAVSNori at MOIs 2x10 s , 2xl0 6 , or 2xl0 7 genomes per cell. 
Infected cells are plated in 96 well plates in the presence or the absence of G418. The 
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latter is included because G418 may cause amplification of integrated vector DNA 
(Rutledge, E. A. et a!., 1997, Journal of Virology, 71:8429-36). Genomic DNA from 
isolated clones can be analyzed by regular Southern Blot as described in the Examples 
Section to confirm the presence of vector concatemers and calculate the number of 
5 integrated vector copies. More informative is the sequencing of integrated vector copies 
and their junctions with chromosomal DNA. The structure of integration junctions can 
be delineated using the role of AAV ITRs in vector integration and the extent of 
insertional mutagenesis after transduction. This data provides information about the 
potential risks of hybrid vector used in clinical trials. 

10 

Transduction of cell cycle arrested cells: 

The ultimate target for the hybrid vectors described in the Specification are quiescent 
hematopoietic stem cells. We hypothesize that the double-stranded nature of AAd.AAV 

15 genomes and specific nuclear import mechanisms may allow for the transduction of non- 
dividing cells. This is in part supported by the transduction studies with AAd.AAV 1 in 
quiescent hepatocytes in vivo. To confirm this data, primary fibroblasts can be forced to 
enter the Go phase by serum/growth factor starvation before infection with the hybrid 
vectors according to a protocol described by Russell (Russell, D. et al., 1995, PNAS, 

20 92:5719-23). Cells are maintained for three days after infection under serum/growth 
factor deprivation. At this time point, genomic DNA is isolated and analyzed for 
integration events by PFGE in comparison with growing cells. Another series of 
integration studies can be carried out on K562 cells arrested in the Gi/S phase of the cell 
cycle with aphidicolin (added 1 day before and maintained several days after infection 

25 with hybrid vectors - depending on the integration kinetics studies described earlier). To 
investigate whether DNA damaging agents increase the transduction frequency of hybrid 
vectors, cell-cycle-arrested K562 cells or primary fibroblasts can be treated with 
cisplatinum or 3 H-thymidine prior to virus infection according to a protocol described by 
Alexander and Russell (Alexander, I. E. et al., 1994, J. Virol, 68:8282-87; Russell, D. et 

30 al., 1995, PNAS, 92:5719-23). Furthermore, the effect of chromosomal DNA 
decondensation on the transduction efficiency of hybrid vectors can be studied in arrested 
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cells after treatment with puromycin, staurosporin, Hoechst 3328, distramycin, or 
vandate. 

F. Improvements in AAd.AAV Production and Purification 

5 

To inhibit packaging of full-length genomes a modified form of I-Sce I, a yeast 
mitochondrial intron-endonuclease with a non-palindromic 18-bp recognition sequence is 
expressed in 293 cells. Constitutive expression of this enzyme in mammalian cells is not 
toxic, possibly due to either the lack of I-Scel sites in the genome or sufficient repair of 

10 them (Rouet P. et al, 1994. PNAS. 91:6064-8). The yeast I-Sce I is modified with an 
SV40 T-antigen nuclear localization signal and an optimal Kozak sequence to enhance its 
functionality in mammalian cells (Rouet P. et al, 1994 . PNAS, 91:6064-8). For another 
yeast endonuclease it was shown that a recognition site within an transduced Ad genome 
was efficiently (30% of all transduced genomes) when expressed in human A549 cells. 

15 Importantly, the expression of E4 ORF6 and ORF3 expressed from the transduced Ad 
genome inhibited double-strand break repair mediated by the endonuclease (Nicolas, A. 
L. et al, 2000 , Virology, 266:21 1-24). This is consistent with the observations by others 
where these E4 proteins prevent concatemerization of the viral genome (Boyer, J. et al, 
1999 . Virology, 263:307-12). Based on this, packaging of full-length virus containing a 

20 I-Scel recognition site is reduced in 293 cells constitutively expressing I-Sce I. The 
1 8mer I-Sce site is inserted into the E3 region of the Ad.IR vectors. These vectors are 
generated and amplified in 293 cells followed by a large-scale infection of 293 cells 
expressing I-Scel. Alternatively, an expression cassette for the endonuclease Xhol is 
inserted into the E3 region of Ad.IR or Ad.AAV vectors. The Xhol gene will be 

25 modified for optimal function in mammalian cells. Vectors expressing Xhol are 
generated and amplified in 293 cells expressing the Xho I isoschizomer PaeR 7 
methyltransferase (PMT) (Nelson, J. E. et al, 1997. J. ViroL, 71:8902-7), which mediates 
the addition of a methyl group onto the N6 position of the adenine base of Xho I sites, 
CTCGAG. This protects the viral and cellular genome from Xhol cleavage. Methylated 

30 Ad vectors are produced at high titers. AAd.AAV vectors are then obtained by large- 
scale infection of 293 cells with the Ad.AAV-Xhol vectors. At this stage the viral 
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genome is not methylated and is digested at the Xhol sites. Xhol sites present within the 
transgene cassette are deleted by site-directed mutagenesis without altering the amino 
acids sequence. (Xhol is accumulated only at. late stages in virus replication and should 
act only upon a large part of Ad DNA when replication is completed. In addition, 
ultracentrifugation optimizes the separation between AAd.IR and AAd.IR particles 
(Blague, C. et al., 2000. Blood, 95:820-8). 

EXAMPLE II 

MODIFIED FIBER PROTEIN 

A. Test the Infectivitv of Diff erent Human or Animal Serotype on Human Bone 
Marrow Cells. 

Since the amino acid sequence of the fiber knob region varies considerably among the 
-50 known serotypes, it is thought that different adenovirus serotypes bind to different 
cellular receptor proteins or use different entry mechanisms (Shenk, T., 1996, In B. N. 
Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott-Raven Publisher, Philadelphia; 
Mathias, P. et al., 1994, Journal of Virology, 68:6811-14; Defer, M., et al., 1993, J. of 
Virology, 64, 3661-3673). Although most adenoviruses contain RGD motifs in the 
penton base proteins, there are a number of serotypes (e.g. Ad 40, 41) without this 
conserved sequence. These types may use integrin Dv-independent pathways for virus 
internalization (Davison, A.J., et al., 1993, J Mol Biol , 234, 1308-1316; Mathias, P. et 
al, 1994, Journal of Virology, 68:6811-14). To test whether other Ad serotypes can 
infect stem cell subpopulation present in human bone marrow, studies with a series of 
different human Ad serotypes and animal viruses can be performed (see Table II). As a 
means to verify efficient transduction with Ad serotypes, viral DNA is tagged before 
infection and the presence of viral genomes in the nuclei of transduced cells is 
investigated. Furthermore, whether viral DNA is replicated in transduced cells can be 
analyzed as indirect proof for early viral gene expression. A direct detection of expressed 
viral proteins is impossible due to the unavailability of antibodies against all the 
serotypes included in this study. Simultaneously with the infection assay, transduced 
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human bone marrow cells can be analyzed for morphological and immunohistochemical 
features characteristic of HSC or progenitor subpopulations. For retargeting, serotypes 
which are able to infect CD34+ subsets of bone marrow cells at the lowest MOI are 
selected. As the next step, the fiber gene is PCR-cloned from serotypes with potential 
HSC/CD34+ tropism and inserted into standard shuttle plasmids for Ad5 vector 
generation replacing the Ad5 fiber gene using an E. coli recombination system (Figure 
9). 

METHODS 
Cells and viruses: 

HeLa (human cervix carcinoma, ATCC CCL-2.2), CHO (Chinese hamster ovary, ATCC 
CCL-61), K562 (human hematopoietic, ATCC 45506), HEp-2 (human larynx carcinoma, 
ATCC CCL-23), 293 (human embryonic kidney, Microbix, Toronto Canada) cells were 
maintained in DMEM, 10% FCS, 2 mM glutamine, and Pen/Strep. Culture media for 
CHO cells was supplemented with 200jaM asparagine and 200joM proline. Human 
CD34+-enriched bone marrow cells were purified from peripheral blood after 
mobilization using MiniMACS VS + separation columns (Miltenyi Biotec, Auburn, CA), 
according to the manufacturer's instructions. Aliquots were stored in liquid nitrogen. 
Sixteen hours before the experiment, cells were recovered from the frozen stock and 
incubated overnight in IMDM media, supplemented with 20% FCS, 10" 4 M p- 
mercaptoethanol, 100 jag/ml DNasel, 2 mM glutamine, 10 U/ml IL-3, and 50 ng/ml stem 
cell factor (SCF) or 2 ng/ml thrombopoietin (Tpo). The purity of CD34+ preparations 
was verified by flow cytometry and was consistently greater than 90%. 

Flow cytometry: 

Adherent cells (CHO, HeLa) grown in non-tissue culture treated 10 cm dishes (Falcon, 
Franklin Lakes, NJ) were detached by treatment with ImM EDTA and washed three 
times with wash buffer (WB), consisting of PBS supplemented with 1% FCS. Cells 
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grown in suspension (K562, CD34+) were washed three times with WB. After washing, 
cells were resuspended in WB at 2 x 10 6 cells/ml. 2 x 10 5 cells were incubated in WB for 
1 h at 37°C with monoclonal antibodies specific for <x v -integrins [L230, ATCC: HB- 
8448, (Rodriguez, E., Everitt, E. 1999. Arch. Virol 144:787-795) (1/30 final dilution), 
CAR [RmcB (Bergelson, J. M., et al. 1997. Science. 275:1320-1323; Hsu, K.-H., L., et al. 
1988. 7. Virology. 62:1647-1652) (1/400 final dilution)], or BrdU [(Amersham, Arlington 
Heights, IL) (1/100 final dilution)]. Subsequently, cells were washed with WB, and 
incubated with fluorescein isothiocyanate (FITC)-labeled horse anti-mouse IgG 
antibodies [(Vector Labs., Burlingame, CA) (1/100 final dilution)] or phycoerythrin (PE> 
labeled goat anti-mouse IgG antibodies [(Calbiochem, La Jolla, CA) 1:100 dilution] for 
30 nain at 4°C. After incubation with secondary antibodies, cells were washed two times 
with WB and 10 4 ceils per sample were analyzed in duplicate by flow cytometry. 

For the analysis of CD34 and c-kit expression on transduced CD34+-cells and for 
fluorescent activated cell sorting (FACS), purified human CD34+ cells were incubated 
with phycoeiythrin(PE)-conjugated anti-CD34 monoclonal antibodies (Becton-Dickinson 
Immunocytochemistry Systems, San Jose, CA) or with PE-labeled anti-CD117 (c-kit) 
monoclonal antibodies (MAb 95C3, Immunotech, Beckman Coulter, Marseille, France) 
according to the manufacturer's protocol followed by flow cytometry analysis. All 
analyses and sortings were performed on a FACStar Plus flow cytometer (Becton 
Dickinson, Franklin Lakes, NJ) equipped with 488nm argon and 633 nm HeNe lasers. 
For analysis of c-kit expression and FACS purification of CD34+/c-kit+ cells, SCF was 
not added to the media during culturing of CD34+ cells. 

RESULTS 

CAR/ ct v-integrin expression on test cells: 

It is generally accepted that CD34+ cells possess bone marrow repopulating activity. 
Therefore, we used human CD34+ cells as the target for our studies towards identifying 
Ad serotypes with HSC tropism and constructing new viral vectors. Studies were 
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performed on mobilized, CD34-positive, peripheral blood cells from one donor under 
conditions which are known to retain CD34+ cells. in a quiescent stage (Leitner, A., et al. 
1996. Br.J.HaematoL 92:255-262; Roberts, A. W., Metcalf, D. 1995. Blood. 86:1600- 
1605). More than 90% of purified cells were CD34 positive by flow cytometry. 
Furthermore, we included into our Ad tropism studies the cell line K562, which is 
considered to be an adequate model for studying gene transfer into human hematopoietic 
cells (McGuckin, et al. 1996. British Journal of Haematology. 95:457-460). HeLa cells, 
which are readily infectible by Ad5, and CHO cells, which are refractory to Ad5 infection 
(Antoniou, M. et al., 1998, Nucleic Acid Res., 26:721-9), were used as positive and 
negative control cell lines, respectively. 

For Ad5, both, binding to the primary receptor and to a 3 p 5 and a m p 5 integrins are 
important for high efficiency infection of target cells. The expression of CAR and a v 
integrins on test cells was analyzed by flow cytometry using monoclonal antibodies 
against CAR (RmcB (Bergelson, J. M, et al. 1997. Science. 275:1320-1323; Hsu, K.-H., 
L., et al. 1988. J. Virology. 62:1647-1652)) and oc v integrins (L230 (Roelvink, P. W., et 
al. 1996. J. Virology. 70:7614-7621)) (Figure 10). As expected, nearly all HeLa cells 
expressed high levels of CAR and a v -integrins, whereas CHO cells lacked significant 
CAR and a v -integrin expression. Fifteen and 77% of K562 cells expressed CAR and av- 
integrins, respectively. Only -6% of the CD34+ cells used in our studies expressed CAR 
and 17% were positive for ct v -integrins. Notably, the preparation of CD34+ cells 
represents a mixture of different cell types. The absent or low expression of primary and 
secondary Ad5 receptors on non-cycling human CD34+ cells is in agreement with 
previous reports (Huang, S., et al. 1996. J. Virology. 70:4502-4508; Neering, S. J. s et al. 
1996.. Blood. 88:1147-1155; Tomko, R. P., et al.. 1997. Proc. Natl Acad: Set. USA. 
94:3352-3356). 

Infection assay using wild-twe Ad5 and K562 cells: 

The presence of viral DNA in the nucleus of infected cells is an indirect means to 
demonstrate efficient virus binding, internalization, and nuclear import. Nuclear 
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localization of the viral genome is a prerequisite for transgene transcription and 
integration. Two techniques are utilized to tag viral DNA for in situ analysis. To 
optimize the infection assay, wild-type Ad5 virus and K562 cells which are permissive 
for Ad5 infection can be used. The first protocol (Challberg, S.S. and Ketner, S. 1981, 
Virology 1 14, 196-209), is based on 32 P-labeling of viral DNA. During amplification of 
wild-type Ad5 and A549 cells, 32 P-phosphate (40nCi/ml) is added to phosphate-free 
medium. After development of CPE, 32 P-tagged virus is harvested, banded in CsCl 
gradients, and titered on HeLa cells according to standard protocols. To simulate the 
conditions for infection of human bone marrow cells, K562 cells are incubated in 
suspension with a MOI of 1, 10, or 100 of 32 P-Ad5 for 2, 4, 6, or 8 hours under agitation 
at 37°C. This covers the time period necessary for adsorption, internalization, and 
nuclear import. After washing, cells are fixed either transferred to microscopy slides 
using cytospin or embedded in paraffin and sectioned (according to protocols from 
VECTOR labs, Burlingham, CA). The latter has the potential advantage that multiple 
consecutive sections (5^m) of the same cell can be analyzed by different methods (e.g. 
for 32 P tagged viral DNA, for specific histological staining, for immunofluorescence), 
which allows for correlating infection with a particular cell type present in the bone 
marrow. Cells are incubated in a Kodak NTB-2 photo emulsion for autoradiography. 
The exposure time can be optimized to minimize background or non-nuclear localized 
signals. A dose and time dependent appearance of nuclear silver grains is expected under 
the optimized conditions. Since 32 P-phosphate can label viral proteins as well, a 
cytoplasmic background signal might appear. To facilitate detection, 
immunofluorescence with HSC specific antibodies on sections can be performed. As an 
alternative method, a BrdU-labeling technique for viral DNA can be used (Lieber, A., et 
al. 1999. J Virol 73:9314-24; Lieber, A. et al., 1996, Journal of Virology, 70:8944-60). 
In this case, different amounts of BrdU are added to the A549 culture medium during 
wtAdS virus propagation. BrdU labeled viral DNA can be detected with monoclonal 
antibodies specific to BrdU. The signal can be enhanced using layers of species-specific 
polyclonal antibodies in combination with biotin/avidin and a fluorescent marker. BrdU 
tagged viral DNA can be detected on cytospins of bone marrow cells together with cell 
surface markers by double or triple immunoflourescence. 
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DISCUSSION 

The interaction of selected Ad serotypes with CD34+ cells was tested. As a result of this 
screening we constructed a first-generation, Ad5-based vector whose fiber was 
substituted with the fiber derived from Ad35. We demonstrated that this capsid 
modification allowed for efficient viral transduction of potential HSCs by the 
corresponding chimeric Ad vectors. 

All tropism and transduction studies were performed with non-cycling CD34+ cells, 
which are thought to include HSCs. The quiescent stage of CD34+ cells purified from 
mobilized blood is important because induction of cell proliferation is associated with a 
loss of the ability to reconstitute hematopoiesis and with changes in the spectrum of 
cellular receptors (Becker, P. S., et al. 1999. Exp. HematoL 27:533-541). It is known that 
treatment of hematopoietic cells with cytokines or growth factors changes the expression 
of specific integrins including a v -integrins, which would ultimately alter the 
susceptibility of cells to Ad infection or may effect viability of infected cells (Gonzalez, 
R., et al. 1999. Gene Therapy. 6:314-320; Huang, S., et al. 1995. J. Virology. 69:2257- 
2263). Another fact that complicates the interpretation of transduction studies is the 
extraordinary heterogeneity of CD34+ cells in regards to morphology and function. 

B. Screening different adenoviruses to establish tropism to HSC. 

The ATCC provides more than 70 different human or animal adenoviruses (see Appendix 
I). A collection of 15 human serotypes and 6 animal adenoviruses (see Table II) are 
selected based on the following criteria: (i) availability of the complete genome sequence 
or fiber sequence from the NIH gene bank (ii) CAR receptor usage absent or unknown, 
(iii) different subgroups, and (iv) moderate or low tumorigenicity (Shenk, T., 1996, In B. 
N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott-Raven Publisher, Philadelphia). 
However, any serotype shown in the Appendix hereto can be used for the invention 
described. Animal viruses are included in the infectivity assay because this may provide a 
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means to circumvent the pre-existing humoral immunity against human Ad5 fiber, which 
represents a critical obstacle for clinical trials with Ad vectors. 

METHODS 

5 

Viruses: 

The following human adenovirus serotypes were purchased from the ATCC: 3 (VR-3), 4 
(VR1081), 5 (VR-5), 9 (VR1086), 35 (VR-716) and 41(VR-930). Adenovirus No. VR- 

10 716 was purchased from ATCC labeled as serotype 34, however it was found to be 
serotype 35 upon sequencing of the fiber region. For amplification, the corresponding 
Ads were infected onto HeLa, 293, or HEp-2 cells under conditions that prevented cross- 
contamination. Virus was banded in CsCl gradients, dialyzed and stored in aliquots as 
described elsewhere (Lieber, A., C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). 

15 Plaque titering was performed as follows: Confluent 293 cells plated in 6- well plates 
were incubated for 24 hours with virus in a total volume of 1ml. Two weeks after 
infection, plaques were counted on . cultures overlayed with 1% agarose/MEM/10% 
FCS.O 1 

20 EM studies: 

CsCl-banded Ad stocks were thawed and diluted with 0.5% glutaraldehyde. Grids were 
prepared as described earlier (Mittereder, N., et al. 1996. J. Virology. 70:7498-7509). 
After staining with 2% methylamine tungstate (Nanoprobes, Stony Brook, NY), the 
25 carbon-coated grids were evaluated and photomicrographed with a Phillips 410 electron 
microscope, operated at 80 kV (final magnification 85,O00x). For each particular Ad 
serotype, the number of morphologically deficient viral particles per 100 was counted in 
five random fields. 

30 RESULTS 
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Electron microscopy: , 

i 

Little is known about the stability of particles from serotypes other than Ad5. Since the 
intactness of viral particles was crucial for comparative interaction studies, virions from 
the serotypes specified above were analyzed by electron microscopy (EM). EM studies of 
negative contrast stained Ad suspensions demonstrated that the percentage of defective 
particles (loss of icosahedral shape or luminal staining) did not exceed 5% indicating that 
serotype preparations had comparable qualities. Representative EM photographs are 
shown for Ads 5, 9, and 35 (Figure 1 1). 

Serotype screening: 

It is thought that different Ad serotypes bind to different cellular receptor proteins and 
use different entry mechanisms (Defer, C, et al., P. 1990. J Virology. 64:3661-3673; 
Mathias, P., et al. 1994. Journal of Virology, 68:6811-6814). A set of human 
adenoviruses was obtained from the ATCC to be tested for tropism to.CD34+ cells. 
These included serotypes 3, 4, 5, 9, 35, and 41 representing different subtypes (Table 1). 
We believed that these serotypes would use different cellular attachment and 
internalization strategies due to differing lengths of fiber shafts (Chroboczek, J., et al. 
1995. Adenovirus fiber, p. 163-200. In a. P. B. W. Doerfler (ed.), The molecular 
repertoire of adenoviruses, vol. 1. Springer Verlag, Berlin; Roelvink, P. W., et al. 1998. 
J. Virology. 72:7909-7915), the presence or absence of RGD motifs within the penton 
base, arid differing tissue tropism. The relatively little characterized Ad35 was selected 
because it was found in immunocompromised hosts, particularly in bone marrow 
recipients (Flomenberg, P., et al. 1994. Journal of Infectious Diseases. 169:775-781; 
Flomenberg, P. R., et al. 1987. Journal of Infectious Diseases. 155:1127-1134; Shields, 
A. R, et al. 1985 New England Journal of Medicine. 312:529-533). The latter 
observations prompted us to believe that bone marrow cells are among the natural 
reservoirs for Ad35. 
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TABLE H 



Human and animal adenoviruses with potential interest for the invention 



Adeno 


Human/ 


Human/ 


Human/ 


Avian 


Bovi 


Canin 


Ovin 


Swin 


Mous 


virus 


Group B 


Group 


Group 




ne 


e 


e 


e 


e 






D 


F 














Sero 


3,7,11,16, 


8,15,17, 


40,41 


CELO, 


3 




5 


4 


1 


type 


21,34,35 


19,28,3 
7 




EDS 













The underlined serotypes use CAR independent pathways for cell entry. 



For amplification, the corresponding adenovirus stocks can be infected onto HeLa or 
A549 cells such that at a given time only one virus type is handled in a separate laminar 
flow hood and cultured in Hepa-filtered bottles, preferentially in separate CQ2 incubators 
to avoid cross-contamination. During propagation, viral DNA is tagged using one of the 
techniques described earlier. Viral DNA can be isolated from purified particles. The 
Xhol restriction pattern is analyzed for methylated and unmethylated viral DNA by 
Southern blot using the full genome of the corresponding virus type as a radioactive 
probe. 

DISCUSSION 

Although it was reported earlier by slot-blot assay that fiber knobs derived from 2, 9, 4, 
and 41L can bind to CAR (Roelvink, P. W., et al.. 1998. J. Virology. 72:7909-7915), it is 
not clear whether this binding occurs with an affinity that is physiologically relevant and 
whether this would confer cell entry. Furthermore, as shown for the Ad5 interaction 
between the penton and intergrins, a secondary receptor is required to induce virus 
internalization. We demonstrated that different serotypes interacted differently with the 
K562 or CD34+ target cells. Ad5, Ad4, and Ad41 were not able to efficiently attach to 
and be internalized by K562 and CD34+ cells. Although Ad4 belongs to a separate 
subgroup (£), it is thought that Ad4 represents a natural hybrid between subgroup B and 
C viruses with a fiber related to Ad5 (Gruber, W. C, et al. 1993. Virology. 196:603-611). 
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Therefore, it was not surprising that Ad4 has binding properties similar to Ad5. The 
subgroup F serotype Ad41 has been shown to contain distinct fibers, a long shafted and a 
short-shafted fiber allowing for different cell entry pathways (Tiemessen, C. T., Kidd, 
A.H. 1995. J. Gen. Virol. 76:481-497). The Ad41 penton base does not contain RGD 
motifs suggesting that this virus may use a v -intregrin independent pathways for cell 
entry. However, these features did not improve interaction with CD34+ cells. Ad9, Ad3, 
and Ad35 did interact with CD34+ cells more efficiently than Ad5. Out of all the 
serotypes tested, Ad35 demonstrated the most efficient attachment and internalization 
with K562 and CD34+ cells. Although the short-shafted Ad9 can bind to CAR, it 
preferentially uses ct v -integrins for cell entry (Roelvink, P. W., et al. 1996. J. Virology. 
70:7614-7621). Therefore, the low level of a v -integrin expression on certain subset/s of 
CD34+ cells may account for the observed susceptibility to Ad9. 

C. Attachment and Internalization of the Ad serotypes to K562 and CD34+ cells, 

METHODS 

Labeling of Ads with ^Hi-methyl thymidine: 

Serotypes were labeled with [ 3 H]-methyl thymidine as described in detail elsewhere 
(Roelvink, P. W., et al. 1996. J. Virology. 70:7614-7621). Briefly, 5xl0 7 HeLa or 293 
cells were grown in 175 sq. cm flasks with 15 ml DMEM/10% FCS and infected with 
wild type adenovirus at a MOI of 50 or higher. Twelve hours post-infection, 1 mCi of 
[ 3 H]-methyl thymidine (Amersham, Arlington Heights, DL) was added to the media and 
cells were further incubated at 37°C until complete CPE was observed. Then, cells were 
harvested, pelleted, washed once with cold PBS, and resuspended in 5 ml PBS. Virus was 
released from the cells by four freeze-thaw cycles. Cell debris was removed by 
centrifiigation and viral material was subjected to ultracentrifugation in CsCl gradients 
and subsequent dialysis as previously described (Lieber, A., C.-Y. et al. 1996. Journal of 
Virology. 70:8944-8960). Virus purification and dialysis removed unincorporated 
radioactivity. Wild type Ad particle concentrations were determined 



67 




WO 00/73478 PCT/USOO/15442 

spectrophotometrically by measuring the OD260, utilizing the extinction coefficient for 
wild-type Ad5 e 260 = 9.09 x 10* 13 OD ml cm virion 1 (Maizel, J. V., et al. 1968* Virology. 
36:115-125). The virion specific radioactivity was measured by a liquid scintillation 
counter and was always in the range of 1 x 10" 5 to 1 x 10" 4 cpm per virion. For selected 
5 variants, the fiber gene was PCR amplified and sequenced to ensure identity and the 
absence of cross-contamination. 

Viral DNA tagged with methvlase and test for replication by genomic Southern blots: 

10 To ultimately confirm transduction, a protocol to detect adenoviral replication in infected 
cells can be established. Viral DNA synthesis can only occur after de novo expression of 
adenoviral early genes. A site-specific methylation strategy is utilized to monitor viral 
DNA replication within infected cells (Nelson, J. et al. 9 1997, Journal of Virology, 
71:8902-07). Methylation marked adenovirus can be produced by the addition of a 

15 methyl group onto the N6 position of the adenine base of Xhol sites, CTCGAG, by 
propagation of the virus in HeLa or A549 cells expressing the Xhol isoschizomer PaeR7 
methyltransferase (PMT) (Kwoh, T.J., et al., 1986, Proc. Natl Acad. Set USA 83, 7713- 
7717). It is known that methylation does not affect vector production but does prevent 
cleavage by Xhol. Loss of methylation through viral replication restores Xhol cleavage 

20 and can be detected by Southern blots of genomic DNA from infected cells in 
comparison to native, non-methylated, viral genomes. 

Attachment and internalization assays: 

25 These studies were performed based on a protocol published elsewhere (Wickham, T. J., 
et al. 1993. Cell 73:309-319). In preliminary experiments, we found that labeled Ad5 
virions reached equilibrium in attachment to HeLa cells after 45 min at 4°C with an MOI 
of 400 pfu per cell. For attachment studies, 3.5 x 10 5 cells were incubated for one hour on 
ice with equal amounts of [ 3 H]-labeled adenovirus OD particles equivalent to an MOI of 

30 400 pfu/cell for Ad5 in 100 \x\ of ice-cold adhesion buffer (Dulbeco's modified Eagle's 
medium supplemented with 2 mM MgCl 2 , 1% BSA, and 20 mM HEPES). Next, the cells 
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were pelleted by centrifugation for 4 min at 1000 x g and washed two times with 0.5 ml 
ice-cold PBS. After the last wash, the cells were pelleted at 1500 x g, the supernatant was 
removed, and the cell-associated radioactivity was determined by a scintillation counter. 
The number of viral particles bound per cell was calculated using the virion specific 
5 radioactivity and the number of cells. To determine the fraction of internalized [ 3 H]- 
labeled adenoviral particles, cells were incubated on ice for one hour with the 
corresponding virus, washed with PBS as described above, resuspended in 100 pi 
adhesion buffer, and then incubated at 37°C for 30 min. Following this incubation, cells 
were diluted 3-fold with cold 0.05% trypsin-0.5mM EDTA solution and incubated at 

10 37°C for an additional 5-10 min. This treatment removed 99% of attached radioactivity. 
Finally, the cells were pelleted at 1 500 x g for 5 min, the supernatant was removed, and 
the protease-resistant counts per minute were measured. This protocol minimizes the 
possibility that the internalization data were affected by receptor recycling (Rodriguez, 
E., Everitt, E. 1999. Arch. ViroL 144:787-795). Nonspecific binding of Ad particles to 

15 cells on ice was determined in the presence of 100-fold excess of unlabeled virus. This 
value routinely represented less than 0.1 % of viral load. 

RESULTS 

20 Attachment of Ad particles to target cells and internalization: 

The selected serotypes were metabolically labeled with [ 3 H]-thymidine, which is 
incorporated into viral DNA during replication. Adsorption and internalization can be 
experimentally dissociated by taking advantage of the observation that at low temperature 

25 (0-4°C) only virus cell attachment occurs, whereas internalization requires incubation at 
higher temperatures. The number of particles adsorbed or internalized per cell was 
calculated using the virion-specific radioactivity and used to quantify interaction of Ads 
3, 4, 5, 9, 35, and 41 with CD34+, K562, HeLa and CHO cells (Figure 12). The serotypes 
varied significantly in their ability to attach to and to be internalized by the different cell 

30 lines. For Ad5, the degree of attachment to the cell lines tested correlated with the level 
of CAR expression. In CHO cells, which were previously shown to be refractory to Ad5 
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infection, the level of attachment and internalization was about 50-70 viral particles per 
cell. This number was hereafter assumed negative in terms of susceptibility of a given 
cell type for Ad5. Interaction of the other serotypes with CHO cells was not significantly 
higher indicating that corresponding receptor/s were absent on CHO cells. All serotypes 
5 tested interacted with HeLa cells; with Ad3 and Ad35 being the most efficient variants. 
The presence of distinct Ad3 and Ad5 receptors on HeLa cells was demonstrated 
previously (Stevenson, S. C, et al. 1995. J. Virology. 69:2850-2857). Ads 4, 5, and 41 
did not bind to K562 cells. In contrast, Ad9 as well as the members of subgroup B, Ad3 
and Ad35, efficiently interacted with K562 cells with Ad35 having the highest number of 

10 adsorbed and internalized particles. Compared to Ad5, about 25 times more Ad35 
particles were attached and three-forth of these were internalized by K562 cells. Viral 
interactions with CD34+ cells were generally weaker. Among the serotypes tested, only 
Ad9 and Ad35 were significantly internalized by non-cycling CD34+ cells. 
Internalization of Ad9 and Ad35 was, respectively, four and eight times more efficient 

15 than for Ad5 particles. The number of Ad35 virions internalized by CD34+ cells was 
almost half of that seen for Ad5 in HeLa cells, which can be readily infected with Ad5 
based vectors. 

Attachment and internalization of adenovirus serotypes 3. J. P. 35 and 41 into HeLa. 293. 
20 and CHO cells: 

Hela and 293 cells expressing high level of primary and secondary receptors for human 
adenoviruses are used as a positive control for virus attachment and internalization. As a 
negative control CHO cells are used. CHO cells do not express the primary adenoviral 

25 receptor at a detectable level, and are therefore refractory for adenoviral infection. For 
attachment studies, these adherent cell lines are detached from 10 cm dishes with PBS- 
EDTA solution (without Ca2+ and Mg2+), washed three times with ice-cold PBS, 
resuspended in adhesion buffer, and incubated with viruses as described above in the 
Examples section. As expected, all adenoviral serotypes tested are efficiently attached to 

30 and internalized into Hela cells (Table HI) (Figure 13). Adenoviruses serotypes 3, 5, 35, 
41, but not 9, are efficiently attached to and internalized by 293 cells. In contrast, poor 
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attachment and internalization of most adenovirus serotypes are observed with CHO 
cells. Hie level of attachment on CHO is about 50-70 virus particles per cell for 
adenoviruses serotypes 5 and 41, 115 virus particles per cell for adenovirus type 3 and 
about 180 particles per cell for adenovirus serotypes 9 and 35. For further analysis, 
numbers >300 viral particles per cell are assumed as positive and <70 viral particles per 
cell as negative in terms of susceptibility of a particular cell line for efficient adenoviral 
transduction. 



TABLE m 

Comparative analysis of attachment and internalization of Ad5 and Ad9 
to cell lines, expressing different amounts of CAR and aupintegrins. 



Cell line 


CAR 


ccuP-integrin 


Ad9 


Ad5 




expression 


expression 


(attached/ 


(attached/ 








internalized) 


internalized) 


HeLa 


++ 


++ 


426/370 


550/500 


CHO 




++ 


300/300 


70/50 


293 


++ 


++ 


20/20 


1950/1750 


Y79 


+++ 




190/140 


1200/1100 


K562 




+ 


320/230 


60/50 


Erythrocytes 


? 


? 


420/- 


68/- 



Attachment and internalization of adenovirus serotypes 3. 5. P. 35 and 41 into human 
CD34+ bone marrow cells andK562 ervthroleukemia cell line: 

Previous studies showed that the human erythroleukemia cell line K562 can be 
transduced with Ad5-based adenoviral vectors at very high MOIs. As shown in Figure 
14, only about 60 viral particles per cell of adenovirus serotype 5 are attached to and even 
fewer particles are internalized into these cells at a MOI of 400. In contrast to Ad5, about 
320 viral particles per cell of Ad9 and about 1500 viral particles per cell of Ad35 are 
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attached to and about two-thirds of them are internalized into K562 cells (Figure 14B). 
Human unstimulated CD34+-enriched bone marrow cells obtained from frozen stocks are 
incubated overnight in growth medium without cytokine stimulation. The next day, the 
number of viable cells is calculated. For attachment studies, cells are washed three times 
5 with ice-cold PBS, resuspended in adhesion buffer and incubated with adenoviruses. 
Among the adenoviral serotypes tested, only adenovirus particle of Ad9 (about 150 viral 
particles per cell) and Ad35 (about 320 viral particles per cell) are able to attach to 
unstimulated CD34+ cells on the level, compared to Ad 5 (only 60 viral particles per 
cell). Four-fifths of these virus particles are able to be internalized by the cells. 
10 Interestingly, upon stimulation of CD34+ cells with GM-CSF and EPO/TPO for two 
weeks, attachment and internalization of Ad9 viral particles are significantly increased 
(up to 300 particles per cell). At the same time, the transient stimulation of cells with 
GM-CSF for two days could not increase the level of viral attachment to the cells. 

15 Based on the above finding that Ad35 serotype is able to attach and internalize into 
CD34+ cells most efficiently among several serotypes tested, serotype Ad35 was selected 
for further studies. As described in Appendix II, a chimeric vector (Ad5 GFP/F35) 
containing the short-shafted Ad35 fiber sequence in an Ad5 capsid was able to target a 
broad spectrum of CD34+ cells in a CAR/integrin independent manner. 

20 

DISCUSSION 

In summary, from all the serotypes tested, Ad9, Ad3 ? and Ad35 demonstrated the most 
efficient attachment to and internalization with K562 and CD34+ cells. Based on 
25 adsorption/internalization data, Ad9 and Ad35 as representatives for subgroups D and B 
were selected for further tropism studies. 

D. Characterization of Ad vector replication in K562 and CD34+ cells, 

30 Comparative analysis of Ad5, and Ad9 and Ad34 to infect and to replicate in 293, K562 
and CD34+ cells. The ability of the Ad9 fiber knob domain to recognize the same 
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primary receptor on the cell surface as Ad5 with comparable affinity was described 
earlier. Thus, the finding that Ad9 viral particles can only poorly attach to 293 cells is 
rather unexpected. In order to find out how the attachment and internalization data 
reflect the biological activity of adenoviruses of different serotypes, the stocks of Ad5, 
5 Ad9 and Ad35 are characterized in more detail by electron microscopy, plaque assay on 
293 cells, and quantitative replication assay in K562 and CD344- cells. 

METHODS 
10 Quantitative replication assay: 

lxl 0 5 CD34+ or K562 cells were infected in 100|J of growth media with different MOIs 
of Ad5, 9, or 35 which had been amplified in 293 cells, expressing the Xhol DNA 
methyltransferase isoshizomer PaeR7 (Nelson, J., Kay, M.A. 1997. Journal of Virology. 

15 71:8902-8907). After 2 hours of incubation at 37°C, the cells were centrifuged at 1000 x 
g for 5 min, the virus-containing medium was removed, the cells were resuspended in 
100^1 of fresh media, and then they were incubated at 37°C until harvesting. At 16 hours 
post-infection for K562 cells, or 36 h post-infection for CD34+ cells, 5 jig of pBS 
(Stratagene, La Jolla, CA) plasmid DNA was added as a carrier which could also be used 

20 as a loading control. Genomic DNA was extracted as described previously (Lieber, A., 
C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). One-fourth of purified cellular 
DNA (equivalent to 2.5 x 10 4 cells) was digested with Hindlll, Xhol, or with Hindlll and 
Xhol together at 37°C overnight and subsequently separated in a 1% agarose gel followed 
by Southern blot with chimeric Ad5/9 or Ad5/35 DNA probes. The chimeric probes, 

25 containing sequences of Ad5 and Ad9 (Ad 5/9) or Ad5 and Ad35 (Ad 5/35), were 
generated by a two-step PCR amplification using Pfu-Turbo DNA polymerase 
(Stratagene, La Jolla, CA) and viral DNA from purified particles as a template. The 
following primers were used for PCR (Ad5 sequences and nucleotide numbers are 
underlined): AdSFl - fnt: 32775-32805) 5'- GCC CAA GAA TAA AGA ATC GTT 

30 TGT GTT ATG -3*: Ad5Rl - fnt: 33651-3362H 5'- AGC TGG TCT AGA ATG GTG 
GTG GAT GGC GCC A -3'; chimeric Ad5/9F - (nt: 31150-31177, nt: 181-208 ) 5>-AAI 
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GGG TTT CAA GAG AGT CCC CCT GG A GTC CTG TCA CTC AAA CTA GCT 
GAC CCA -3*; chimeric Ad5/9R - (nt: 32805-32775. nt:l 149-1 113) 5'- CAT AAC 
ACA AAC GAT TCT TTA TTC TTG GGC TTC ATT CTT GGG CGA TAT AGG 
AAA AGG-3; chimeric Ad5/35F - (nt: 31150-31177 . nt: 132-159) 5*- AAT GGG TTT 
5 CAA GAG AGT CCC CCT GGA GTT CTT ACT TTA AAA TGT TTA ACC CCA-3', 
chimeric Ad5/35R (nt: 32805-32775. nt: 991-958) 5»- CAT AAC ACA AAC GAT TCT 
TTA TTC TTG GGC ATT TTA GTT GTC GTC TTC TGT AAT GTA AG-3\ 
Nucleotide numbers are given according to the sequences obtained from the NCBI 
GenBank (accession No. M73260 / M29978 for Ad5, X74659 for Ad9, and U10272 for 

10 Ad35). After the first amplification, the 968 bp-long Ad9, a 859 bp-long Ad35 DNA 
fragments corresponding to the fiber genes, and a 876 bp-long Ad5 fragment 
corresponding to the Ad5 E4 region (located immediately downstream of Ad5 fiber gene) 
were purified by agarose gel electrophoresis. To generate chimeric DNA probes, 
amplified Ad5 DNA was mixed with Ad9 or Ad35 fragments obtained during the first 

15 step of PCR. and subjected to a second PCR amplification using Ad5/9F or Ad5/35F 
primers and the Ad5Rl primer. The resulting Ad5/9 or Ad5/35 chimeric DNA fragments 
(see Fig. 15) were purified and their concentrations were measured 
spectrophotometrically. Corresponding chimeric DNA fragments were loaded as 
concentration standards on agarose gels or labeled with [ 32 P]-dCTP and used as probes 

20 for Southern analysis. The number, of viral genomes per DNA sample was calculated 
after quantitative Phospho-imager analysis. In preliminary experiments, no preferential 
hybridization of chimeric DNA probes to DNA of any particular viral serotype was 
detected. 

25 RESULTS 

Replication of selected serotypes in K562 and CD34+ cells: 

Adsorption/internalization studies do not ultimately prove viral transduction, a process 
30 often defined as gene transfer that allows for viral or heterologous gene expression in 
host cells. Intracellular trafficking, including endosomal lysis, transport to the nucleus, 
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and nuclear import of the viral genome, depends on structural capsid proteins and thus, 
varies between different serotypes (Defer, C, et al., P. 1990. J. Virology. 64:3661-3673; 
Miyazawa, et al. 1999. J. Virology. 73:6056-6065). We believed that analysis of viral 
gene expression would be a means to verify successful nuclear import of viral genomes 
5 and that this would be a good criterion for selection of serotype/s able to efficiently infect 
our target cells. To do this, we used a protocol, which allows for the detection of Ad 
replication in infected cells. Viral DNA synthesis can only occur sifter de novo expression 
of adenoviral early genes. We utilized a site-specific methylation strategy to monitor viral 
DNA replication within infected cells (Nelson, J., Kay, M.A. 1 997. Journal of Virology. 

10 71:8902-8907). Methylated Ad serotypes were produced by the addition of a methyl 
group onto the N6 position of the adenine base of Xho I sites, CTCGAG, during 
propagation of the viruses in 293 cells expressing the Xho I isoschizomer PaeR 7 
methyltransferase (PMT) (Kwoh, T. J., et al. 1986. Proc.Natl.AcadScL USA. 83:7713- 
7717) (293 PMTcells). Loss of methylation through viral replication restores Xho I 

15 cleavage and can be detected by Southern blots of Xho I-digested genomic DNA from 
infected cells. 

Ad replication studies were performed in K562 and CD34+ cells with Ad9 and Ad35, in 
comparison to Ad5. For replication studies, the infectious titer (in pfu/ml) and genome 

20 titer (in genomes per ml) were determined (by plaque assay on 293 cells or by 
quantitative Southern blot, respectively) for methylated and unmethylated Ad5, Ad9, and 
Ad35 (Table 2). The ratio of pfu to genome titer was comparable for methylated and 
unmethylated virus demonstrating that DNA methylation had not altered transduction 
properties. About 85% of (Ad5, 9, and 35) virus used for infection was methylated as 

25 calculated based on the intensity of fragments specific for methylated and non- 
methylated viral DNA present in the viral load (Fig. 15). The numbers of genomes 
detected after adsorption (1 hour, 0°C) or internalization (2 hours 37°C) correlated well 
with studies shown in Fig. 12. Ad9 and Ad35 interacted more efficiently than Ad5 with 
K562 and CD34+ cells. Dose-dependent replication studies in K562 and CD34+ cells 

30 were performed with the same genome numbers of Ad5, 9, and 35 (Fig. 15). The 
replication rate was measured based on the ratio of methylated to demethylated viral 
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DNA after infection with different MOIs (2100, 420, and 105 genomes per cell). In K562 
cells, efficient replication (100% conversion from methylated to unmethylated DNA) was 
detected for Ad5 at MOI >H 2100, for Ad9 at MOI >/=420, and for Ad35 at MOI 
>/=105. This demonstrated that Ad35 transduced K562 cells with the highest efficiency. 
In CD34+ cells, the replication rate was 100% for Ad5 and 31% for Ad9 after infection 
with MOI 420. Although methylated Ad35 viral DNA was present in CD34+ cells, viral 
replication was undetectable for Ad35. In summary, while viral replication studies in 
K562 cells confirmed data obtained for Ad5, 9, and 35 adsorption and internalization, 
there was a discrepancy between earlier results and the poor replication of Ad9 and, 
particularly, Ad35 in CD34+ cells. As outlined later, replication analysis in 
heterogeneous cell populations, like CD34+ cells, may not allow for definitive 
conclusions on tropism of a particular serotype. 

Taking all the screening data together, Ad9 and Ad35 emerged as the variants with the 
strongest tropism for K562 and CD34+ cells. It is thought that Ad9 can bind to CAR, 
however, it preferentially uses Ov-integrins for cell entry (Roelvink, P. W., et al. 1996. J. 
Virology. 70:7614-7621). This entry strategy may be not optimal for efficient infection of 
CD34+ cells as only less that 17% of them express Oy-integrins (Fig. 10). Therefore, we 
decided to concentrate on Ad35 as a source for heterologous fiber to be used for 
construction of a chimeric vector based on an Ad5 backbone. 

TABLE IV 

Results from the infectivity assay which determines the optical 
particle-to-PFU (OPU/PFU) ratio using 293 cells 



Virus 


OPU(A260) 


PFU 


OPU/PFU ratio 


Ad5 


1.4 xlO 12 


1.06x10" 


13 


Ad9 


4.61 x 10 u 


2.6x10" 


1773 



DISCUSSION 
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Viral replication studies in K562 cells confirmed the data obtained for Ad5, 9, and 35 
adsorption and internalization. However, there was a discrepancy between the interaction 
data and the replication data in CD34+ cells where Ad9 replicated only poorly and no 
replication was seen for Ad35. Ad replication is only initiated upon the production of a 
5 critical threshold of early viral proteins, which in turn, is directly dependent on the 
number of viral genomes present in the nuclei of infected cells. Therefore, the outcome of 
replication studies may be affected by the rate of nuclear import of viral genomes, by the 
activity of viral promoters, and/or the intracellular stability of viral DNA/RNA. These 
parameters may vary, on one hand, between different subsets of CD34+, and/or, on the 
10 other hand, between different Ad serotypes. In conclusion, the viral replication analyses 
performed with different Ad serotypes in CD34+ cells may not predict the actual 
transduction properties of chimeric vectors based on Ad5 backbone. This implies that 
attempts to produce gene transfer vectors based on Ad genomes other than Ad5 should be 
exercised with caution. 

15 

Recently, an Ad serotype screening strategy was used to identify variants with tropism 
for primary fetal rat CNS cortex cells or human umbilical vein endothelial cells. The 
optimal serotype (Ad 17) was selected based on immunohistochemistry for hexon 
production 48 hours after infection (Chillon, M., et al. 1999. J. Virology. 73:2537-2540). 

20 However, this approach is problematic because, at least in our hands, antibodies 
developed against Ad5 hexon did not cross-react with other serotypes. Also, hexon is 
expressed only after onset of replication. As outlined above, the kinetics of intracellular 
trafficking, viral gene expression, and replication significantly vary between serotypes 
(Defer, C, et al., P. 1990. J. Virology. 64:3661-3673; Miyazawa, et al. 1999. J. Virology. 

25 73:6056-6065). 

In addition to being the most efficient serotype in terms of interaction with CD34+ cells, 
Ad35 is also interesting because it interacts with receptor/s different from the Ad5 and 
Ad3. Ad35 and Ad5GFP/F35 attachment was not inhibited by Ad5 or anti-CAR 
30 antibodies suggesting that Ad35 binding was CAR independent. First, Ad5 did not 
compete with Ad35 and Ad5GFP/F35 during internalization and infection indicating that 
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amP3/5- integrins are not involved in viral entry. Second, function-blocking antibodies 
against a v -integrins did not compete with Ad35 and Ad5GFP/F35 for internalization into 
K562 cells, whereas these antibodies did inhibit Ad5 internalization. And third, in 
contrast to Ad5 based vectors, GFP expression after infection with Ad5GFP/F35 was not 

5 restricted to <x v -integrin-expressing CD34+ cells. From these facts, we conclude that 
infection with Ad35 and the chimeric Ad5GFP/F35 vector does not involve a v -integrins. 
In this context, the presence or absence of RGD motifs within Ad35 penton base remains 
to be determined by sequencing the corresponding genome region. Cross-competition 
assays demonstrated that Ad35 and Ad5GFP/F35 bind to a receptor that is different from 

10 the Ad3 receptor. Although Ad3 and 35 belong to the same subgroup, they have been 
divided into two DNA homology clusters, Bl arid B2; the amino acids composing their 
fibers are only 60% homologous. Furthermore, the target tissues for both viruses are 
different; Ad3 can cause acute respiratory infections, whereas Ad35 is associated with 
kidney infection (Horwitz, M. S. 1996. Adenoviruses, p. 2149-2171. In B. N. Fields, 

15 Knipe, D.M., Howley, P.M. (ed.), Virology ; vol. 2. Lippincott-Raven Publishers Inc., 
Philadelphia). Therefore, it was not surprising to see that Ad3 and Ad35 recognize 
different receptors. 

In conclusion, Ad35 and the chimeric vector enter the cells by a CAR- and a v -integrin 
20 independent pathway. We believe that Ad35 and the chimeric vector binds primarily to 
its fiber receptor and that this interaction is sufficient to trigger internalization. On the 
other hand, Ad35 internalization may involve cellular proteins other than a v -integrins. 
These membrane proteins can overlap with those for Ad3 internalization and represent p2 
integrins, which protrude more from the cell surface than a v -integrins (Huang, S., et al. 
25 1996.7. Virology. 70:4502-4508). 

According to EM studies of negative contrast-stained adenoviral suspensions, the 
percentage of deficient particles for all adenoviral serotypes tested does not exceed 5%. 
However, plaque assays reveal that the ability to form plaques in 293 cells is significantly 
30 different for tested serotypes. The optical particle-to-PFU (OPU/PFU) ratio obtained is 
13 for Ad5, which is in good agreement with the previously estimated ratio for this 
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adenoviral serotype. Importantly, this ratio is about three times higher for adenovirus 
serotype 35 and more than 150-fold higher for adenovirus serotype 9. Furthermore, 
quantitative Southern blot using chimeric Ad5/9 and Ad5/35 DNA probes is used to 
determine the ratio between the genome and transducing titer. This study confirms the 

5 data obtained by plaque assay. Quantitative replication assay of these adenoviruses in 
K562 and CD34+ cells also confirms the ability of Ad9 and Ad34 to more efficiently 
attach to these cell types. The replication of viral genomes is observed for Ad9 and Ad34 
at lower MOIs of infection, compared to Ad5. In conclusion, the data obtained for 
different serotypes in attachment and internalization are in good agreement with the 

1 0 infectivity data in target cells. 

E. Attachment and internalization of different adenoviral serotypes into primary 
dendritic cells, JAWSIL MCF-7 and REVC cells. 

15 As a proof of principle, the serotype screening strategy can be employed for other 
important target cells which are refractory to Ad5 infection. 



20 RECV cells are endothelial cells which have to be targeted for approaches that are aimed 
to gene therapy of restenosis, atherosclerosis, inflammation etc. MCF-7 cells are breast 
cancer cells isolated from liver metastases which are important targets for tumor gene 
therapy. The human adenovirus serotypes 3, 5, 9, 35 and 41 are tested to see whether 
they can attach to and can be internalized by mouse primary dendritic cells, JAWSII 

25 cells, MCF-7-human breast cancer cells and REVC endothelial cells. None of the 
adenoviral serotype tested can efficiently attach to primary dendritic cells. Adenovirus 
serotype 3 is able to efficiently attach to REVC endothelial cells (about 400 virus 
particles per cell are attached and about 300 are internalized). In comparison, only 50 
Ad5 particles are able to attach to and even fewer are internalized in these REVC. The 

30 human breast cancer cells (MCF-7) are previously shown to be refractory to Ad5 
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infection at low MOIs. However, Ad3 and more efficiently, Ad35 attach to and 
internalize into MCF-7 cells. 

DISCUSSION 

5 

The data presented herein indicate that different human adenovirus serotypes recognize 
different cellular receptors and can therefore infect cell types that are refractory to Ad5 
infection. There are adenoviral serotypes that can more efficiently attach and internalize 
than Ad5 for human CD34+ cells, REVC, K562 and MCF-7 cells. This finding provides 
10 a basis for the construction of chimeric adenoviral vectors which are Ad5 vectors 
containing receptor ligands derived from other serotypes. 

F. Infection studies on primary human bone marrow cells. 

15 Since established erythroleukemic cell lines do not represent an adequate model for the 
ultimate hematopoietic stem cell that has to be targeted in patients in order to achieve 
long-term reconstitution with genetically modified cells, normal primary human bone 
marrow cells are used for the initial infection/retargeting studies. 

20 RESULTS 

In a first set of tropism studies with different Ad serotypes, whole bone marrow cell 
suspensions can be used without preselection. This is advantageous because the tropsim 
of various adenovirus serotypes or genetically retargeted vectors can be analyzed on a 
25 broad spectrum of progenitor subpopulations representing myeloid, erythroid, 
megakaryocyte, lymphoid, dentritic, and monocytic lineages. For short term (< 5 hours) 
infection studies, bone marrow suspensions can be cultured in IMDM supplemented with 
10% FCS, P-mercaptoethanol, and lOu/ml IL-3 for ensuring cell viability. 

30 
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Mononucleated cell assays: 

Mononucleated bone marrow cells can be incubated with MOI 1, 10, 100, or 1000 
pfu/cell of the various adenovirus types for a short time. Paraffin sections or cytospins of 
5 infected bone marrow cells can be analyzed for nuclear-localized, labeled viral DNA. 
BrdU labeling can be visualized by immunoflouresence with anti-BrdU antibodies; 32 P- 
tagged viral DNA can be detected by incubation with photo-emulsion. In addition,, the 
same cell material can be analyzed for morphology after specific histo-staining (e.g. 
Wright-, Hemo3 staining). If required, commercially available antibodies can be used to 

10 specific cell surface markers conjugated directly to different fluorochromes (FITC 
(green), TRIT. ? RPE, (red), RPE-Cy5, AMCA (blue)) to completely characterize infected 
bone marrow subpopulations. Colocalization of BrdU-labeled viral DNA (e.g. as FITC 
signal) with membrane markers signifying infection of specific cell types can be 
demonstrated; for example, potential stem cells/early progenitors (CD34 + , CD38"), 

15 megakaryocytes (CD41a+), eryhthroid cells (glycophorin A+), dentritic cells (CDla+), 
monocytes (CD 14+), or myeloid cells (CD 15+), etc. The morphological analysis of 
infected bone marrow subsets gives a first information whether specific adenovirus 
serotypes can target primitive cell types. 

20 DISCUSSION 

Since the different wild-type adenoviruses do not express a uniform marker gene and do 
not integrate and since detection of tagged viral DNA cannot be done on live cells, it is 
not possible, at this point, to characterize infected cells for clonogenic or repopulation 

25 capacities. Therefore, adenovirus serotypes for retargeting studies are selected, based on 
their ability to infect in vitro purified CD34+ cells at low MOIs. This subset of bone 
marrow cells is known to contain long-term reconstituting cells. Infection studies with 
different adenovirus serotypes can be repeated on purified CD34+ cells (cultured in 
IMDM +10% FCS, (3-mercaptoethanol, and 10 units/ml BL-3) as described above. 

30 Purification of CD34+ cells can be performed by direct immunoadherence on anti-CD34 
monoclonal antibody-coated plates or on MiniMacs columns as described by 
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Papyannopoulou (Papayannopoulou, T. et al., 1996, Experimental Hematology, 24:660- 
69; Papayannopoulou, T. et al., 1993, Blood, 81:229). The purity of isolated CDC34+ 
cells ranges routinely from 80-95%. Analgous infection studies can be repeated with 
selected adenovirus types on CD34+/CD38- subsets. 

5 

To confirm productive infection purified CD34+ cells can be infected with selected 
(methylase-tagged) serotypes and analyze viral DNA replication. Cultures of purified 
human bone marrow CD34+ cells can be used for the transduction and integration studies 
as a model for HSCs. 

10 

It was recently demonstrated that HSC activity does exist in CD34-negative human bone 
marrow subsets (Bathia, M. et al., 1998, Nature Medicine, 4:1038-45; Osawa, M., et al., 
1996, Science, 273:242-5; Goodell, M. et al., 1997, Nature Medicine, 3:1337-45; Zanjani, 
E. D. et al., 1998, Exp. Hematology, 26:353-60). Lin"CD34'38" cells can be tested in the 
15 retargeting and transduction studies in combination with repopulation assays in SCID- 
NOD mice. 

G. Cloning and insertion of the fiber gene. 

20 METHODS 

PCR-clonins of the corresponding fiber gene and insertion into Ad5 based shuttle 
plasmids instead of the endogenous ADS fiber: 

25 One or several adenoviruses with tropism to CD34+ or other HSC containing population 
is selected for further studies described herein. The complete coding region for fiber 
varies between l-2kb, depending on the virus type. The fiber encoding sequences can be 
obtained by PCR with Pfu polymerase from viral DNA isolated from purified particles of 
the selected virus types. The corresponding primers can be designed based on the fiber 

30 sequences available from the EMBL gene bank. The PCR products are cloned as Pad- 
Ball fragment into pCD4 (Figure 1 0), a shuttle vector for recombination of RecA+ E. 
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coli. In pCD4, the heterologous fiber gene is flanked on both sides with Ad5 sequences, 
which are homologous to regions directly adjacent to the fiber reading frame in Ad5. As 
an Ad5 (shuttle vector) derived template for recombination, pCDl, a pBHG 10 
(Microbix, Toronto, Canada) derivative can be used. The recombination procedure is 

5 performed according to a protocol routinely used for recombinant adenovirus generation 
(Chartier, C, et al., 1996, J. of Virology, 70, 4805-481 0). Routinely, 90% of the resulting 
plasmids are accurately recombined. The junctions between the heterologous fiber (X) 
and Ad5 sequences can be sequenced to confirm the accuracy of recombination. The 
resulting plasmid is named pAdSfiberX (pAd5*). The resulting product is used to 

10 generate pAd5 fr -based Ad. AAV containing the heterologous fiber gene. 

Construction of chimeric Ad vectors: 

For transduction studies, two Ad vectors were constructed: AdSGFP and Ad5GFP/F35, 

15 containing a chimeric Ad5/35 fiber gene. Both adenoviral vectors contained a 2.3kb, 
CMV promoter driven EGFP gene [derived from pEGFP-1, (Clontech, Palo Alto, CA)] 
inserted into the E3 region of Ad5. Tlie EGFP expression cassette was cloned between 
Ad5 sequences 25,191-28,191 and 30,818-32,507 into a shuttle plasmid, which contained 
the E3 deletion described for pBHGlO (Microbix, Toronto, Canada). The resulting 

20 plasmid was named pAdGFP. For the chimeric vector, the Ad5 fiber gene in pAdGFP 
was substituted by an Ad5/35 chimeric fiber gene generated by the two-step PCR 
protocol outlined above. In the first PCR step, three DNA fragments corresponding to i) 
the Ad5 fiber 5*-nontranslated region and the first 132 bp of the fiber tail domain (nt 
30,818-31,174), ii) the Ad35 shaft and knob domains (nt 132-991), and iii) the Ad5 E4 

25 region including the Ad5 fiber polyadenylation signal (nt 32,775-33,651 were amplified 
by Pfu-Turbo DNA polymerase. The following primers were used: for the Ad5 tail, 
Ad5F-2 (nt 30,798-30,825) 5'-CGC GAT ATC GAT TGG ATC CAT TAA CTA-3' and 
Ad5R-2 (nt 31,174-31,153) 5'-CAG GGG GAC TCT CTT GAA ACC CAT T-3'; for the 
Ad35 shaft and knob, primers Ad5/35F and Ad5/35R (see above); for the Ad5E4 and 

30 poly A, primers Ad5F-l and Ad5R-l (see above). After 10 PCR cycles, the products were 
purified by agarose gel electrophoresis, combined, and then subjected to a second PCR 
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with primers Ad5F-2 and Ad5R-l. The resulting 2115 bp-long chimeric fiber gene 
contained the Ad5 tail and the Ad35 shaft and knob domains. This product was used as a 
substitute for the Sall/Xbal Ad5 fiber gene containing fragment in pAdGFP. The 
resulting plasmid was named pAdGFP/F35. To generate full-length E1/E3 vector 
5 genomes, pAdGFP and pAdGFP/F35 were inserted in pAdHM4 (Mizuguchi, H., Kay, 
M.A. 1998. Human Gene Therapy. 9:2577-2583) by recombination in Kcoli (Chartier, 
C, E. et al. 1996. Journal of Virology. 70:4805-4810). To do this, the RecA+ Kcoli 
strain BJ5183 was co-transformed with pAdHM4 linearized by Srfl mixed with the Xbal 
fragments containing the GFP genes, the Ad5 or Ad5/35 fiber genes, and the Ad5 

10 homology regions. The resulting recombinants were analyzed by restriction analysis. 
Correct recombinants were amplified in E.coli HB101 and purified by double CsCl 
gradient banding. The plasmids were named pAd5GFP and pAd5GFP/F35. The correct 
structure of the Ad5/35 chimeric fiber gene was confirmed by endonuclease digestion and 
sequencing part of pAd5GFP/F35. To produce the corresponding viruses, pAdSGFP and 

15 pAd5GFP/F35 were digested with Pad to release the viral genomes and transfected onto 
293 cells as described (Lieber, A., C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). 
Plaques developed 7 to 10 days post-transfection in overlayed cultures. Recombinant 
viruses were propagated in 293 cells and purified by standard methods described 
elsewhere (Lieber, A., C.-Y. et al. 1 996. Journal of Virology. 70:8944-8960). 

20 

Hemagglutination assay: 

Twenty-five microliters of serial dilutions of Ad5, Ad35, or chimeric Ad5GFP/F35 
virions in Mcllvaine-NaCI buffer (0.1 M citric acid, 0.2 M Na 2 HP0 4 [pH 7.2], diluted 
25 1 :50 with 0.87% NaCl) were loaded onto 96 well plates. To each dilution, 25 p.1 of a 1% 
suspension of monkey erythrocytes (in Mcllvaine-NaCI buffer) was added. The 
sedimentation pattern was determined after incubation for 1 hour at 37°C. All tests were 
performed in quadruplicates in at least two independent experiments. 

30 
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Southern blot: 

Extraction of genomic DNA, labeling of DNA fragments and hybridization were 
performed as described earlier (Lieber, A., C.-Y. et al. 1996. Journal of Virology. 
5 70:8944-8960). 

RESULTS 

Construction/Characterization of chimeric fiber: 

Previously, it has been shown that exchanging the fiber knob was sufficient to alter the 
tropism of chimeric Ad vectors (Chillon, M, -ct al, 1999. J. Virology. 73:2537-2540; 
Krasnykh, V,, et al. 1998. J. Virology. 72:1844-1852; Stevenson, S. C, et al. 1997. 
J.Virology. 71:4782-4790). As outlined above, the length of the fiber shaft may critically 

15 determine the entry strategy of a particular serotype. Therefore, we decided to replace not 
only the Ad5 fiber knob but also the shaft. The chimeric Ad5/35 fiber contained the Ad5 
tail (amino acid: 1-44) necessary for interaction with the Ad5 penton base linked to 279 
amino acids from Ad35 including the shaft with 7 (i-sheets and the knob (Fig. 16A). The 
endogenous Ad5 fiber polyA signal was used to terminate transcription of the chimeric 

20 fiber KNA. The combination of the Ad5 capsid including the RGD motif containing 
penton base with a short-shafted fiber could be risky because the natural distance 
between the fiber knob and the RGD motifs was disturbed. The Ad5 fiber was 
substituted by the chimeric fiber sequences based on an E1/E3 deleted Ad vector. This 
vector carried a CMV promoter-GFP reporter gene cassette inserted into the E3 region. 

25 The corresponding chimeric virus (Ad5GFP/F35) was produced in 293 cells at a titer of 
* >2xl0 12 genomes per ml. For comparison, an EI/E3 deleted Ad vector containing the 
original Ad5 fiber gene and the GFP expression cassette was generated (Ad5GFP). The 
titer and the ratio of physical to infectious particles was similar between Ad5GFP and 
Ad5GFP/F35 indicating that the fiber modification did not significantly alter the stability 

30 and/or growth properties of the chimeric vector. The correctness of the fiber modification 
was confirmed by restriction analysis of the Ad5GFP/F35 viral genome followed by 
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Southern blot hybridization (Fig. 16B), direct sequencing of the fiber-coding region, and 
a functional test for hemagglutination (HA) of monkey erythrocytes. The agglutination of 
erythrocytes is fiber knob-mediated; it is known that Ad5 does not agglutinate monkey 
erythrocytes whereas Ad35 efficiently does (Pring-Akerblom, P., et al. 1998. J. Virology. 
5 72:2297-2304). In HA tests, Ad5GFP/F35 agglutinated monkey erythrocytes with the 
same efficiency as Ad35 at dilutions of up to 1 :5 12. In contrast, no hemagglutination was 
observed with equivalent Ad5 dilutions. This clearly confirmed the functional activity of 
the chimeric Ad5/35 fiber incorporated into Ad5 capsid. 

10 Generation of chimeric adenoviral vectors (Ad. AAV*) with heterologous fiber 
molecules: Adenoviruses with chimeric Ad5-Ad3 fiber are viable and can be produced at 
high titers (Krasnykh, V., et al., 1996, J. of Virology, 70, 6839-6846; Stevenson, S. C. et 
al., 1997, 7. Virology, 71:4782-90). In order to test whether the fiber substitution 
described herein affects production or stability of adenoviruses, two El -deleted first- 

15 generation, adenoviral vectors are produced with the AAV- □ gal cassette in 293 cells 
using standard protocols. The vector is generated by recombination of pAd.AAV-BG 
(Fig. 17) with pCDl (containing the endogenous Ad5 fiber); the other vector (with 
heterologous fiber) is the recombination product of pAd.AAV (iGal and pAdSfiberX 
(pAd5 fx ). Virus from single plaques is amplified on 293 cells. The production yield per 

20 293 cell can be determined by plaque-titering of 293 cell Lysates. It is anticipated that 
the fiber modification will not critically affect the stability of chimeric vectors. Finally, 
bone marrow cells can be infected with the retargeted vectors. Two days after infection, 
live-cell cytometry is performed for P-gal expression using as substrate Fluorescein di-D- 
D-Galactopyranoside (FDG) (Cantwell, M.J. et al., 1996 Blood 88, 4676-4683; Neering, 

25 S. et al., 1996, Blood, 88:1147-55; Fiering, S. N. et al., 1991, Cytometry, 12:291; Mohler, 
W. et al., 1996, PNAS, 93:57) and the infected cells are characterized for morphology and 
surface markers. Before and during infection, bone marrow cells can be cultured in 
IMDM/FCS supplemented with thrombopoietin (Tpo), which supports the survival of 
HSC (Matsunaga, T. et al., 1998, Blood, 92:452-61; Papayannopoulou, T. et al., 1996, 

30 Experimental Hematology, 24:660-69). Alternatively, retargeted vectors can be 
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generated with the AAV-GFP (green fluorescence protein) cassette and perform FACS 
analysis on transduced cells based on GFP and surface marker expression. 

H. Competition studies of chimeric fiber protein Ad5/35, 

5 

Competition studies: 

Cross-competition studies between Ad5 ? 35, and Ad5GFP/F35 (Fig. 18) for binding and 
internalization were performed in order to investigate in more detail the pathways which 

10 are used by the chimeric vector to infect target cells. Wild-type Ad35 and the chimeric 
vector Ad5GFP/F35 could recognize the same primary receptor as they competed with 
each other for the attachment to K562 cells (Fig. 19A, upper panel). This primary 
receptor is different from that used by Ad5, since neither Ad5 viral particles nor anti- 
CAR monoclonal antibodies (Fig. 19B, upper panel) were able to abrogate Ad35 or 

15 Ad5GFP/F35 binding. In competition studies for internalization, Ad35 and Ad5GFP/F35 
competed with each other with equal efficiency. Ad5 and anti-a v -integrin monoclonal 
antibodies (L230) (Figs. 19C, D; lower panel) did not inhibit internalization of Ad35 or 
the chimeric virus. To consolidate this data, K562 cells were infected with AdSGFP and 
Ad5GFP/F35 after prior incubation of cells with anti-CAR or anti-a v -integrins 

20 monoclonal antibodies followed by analysis of GFP-expressing cells. The transduction 
data mirror the results obtained in adsorption/internalization studies. In summary, this 
demonstrated that Ad35 and Ad5GFP/F35 use a CAR and a v -integrin-independent 
pathway for infection of K562 cells; the structural elements which account for these 
specific properties are located within the Ad35 fiber and can be transplanted into Ad5 by 

25 fiber substitution. 

Ad3 can efficiently interact with K562 cells (Fig. 12), although Ad3 and Ad35 belong to 
the same subgroup (B), the homology between amino acid sequences of their fibers is 
only about 60%. Therefore, we decided to test whether Ad3 could compete with Ad35 
30 and Ad5GFP/F35 for attachment and internalization (Fig. 20). These studies 
demonstrated that Ad35 binding was not inhibited by Ad3 indicating the use of different 
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receptors. Interestingly, Ad3 slightly inhibited attachment of Ad5GFP/F35 (Fig. 20A, left 
panel). In addition to binding to the receptor common for the Ad35 and Ad5GFP/F35 
fiber, the chimeric capsid (e.g. the Ad5 penton RGD motifs) may also interacts with a 
second cellular receptor that overlaps with elements involved in Ad3 binding. In cross- 
5 competition for internalization, pre-incubation of cells at 37°C with Ad35 and with 
chimeric virus significantly decreased internalization of [ 3 H]-Iabeled Ad3 (Fig. 20D, right 
panel). In the reverse experiment, Ad3 as competitor decreased the level of 
internalization by 30% for both, Ad35 and the chimeric virus (Fig. 20B, right panel). As 
expected, Ad5 and Ad3 did not compete for adsorption or internalization. As shown 
10 before (Fig. 19B), anti-CAR and anti-a v -integrin antibodies did not block Ad3 interaction 
with K652 cells. In summary, we concluded that Ad35 and Ad5GFP/F35 bind to 
receptor/s different from that of Ad3, although they can use common structural elements 
for internalization, which are different from a v -integrins. 

15 Infection studies with chimeric virus : 

It is established that Ad5GFP/F35 infected K562 cells by a CAR and a v -independent 
pathway. It is possible that this property allows for efficient transduction of non-cycling 
CD34+ cells, which express scarcely CAR and cc v -integrins. To test this, the transduction 

20 properties of AdSGFP and Ad5GFP/F35 vectors were analyzed on CD3.4+ cells, K562, 
and HeLa cells. Fig. 21 shows the percentage of transduced, GFP expressing cells 
depending on the MOI used for infection. Nearly 100% of HeLa cells were transduced 
with AdSGFP and Ad5GFP/F35 at MOIs of >/=25. More than 95% of the K562 cells 
were transduced with Ad5GFP/F35 at MOIs of >/= 100, whereas the transduction rate 

25 was significantly lower with Ad5 where it increased with the MOI reaching a plateau at 
-70% GFP-positive cells after infection with an MOI of 400. Transduction of CD34+ 
cells was about three fold more efficient with Ad5GFP/F35 than with Ad5GFP at all 
MOIs analyzed. Interestingly, at higher MOIs, the transduction rate did not rise 
proportionally with the viral dose and soon reached a plateau indicating that in both cases 

30 only specific subset/s of CD34+ cells were permissive to infection. In order to 
characterize in more detail these specific, permissive subset/s, additional transduction 
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studies were performed. First, the percentage of GFP expressing cells was determined in 
CD34+ fractions that were stained for a v -integrins or CARs (Fig. 22). The low number of 
CAR positive CD34+ cells complicated accurate co-labeling studies, and there was no 
correlation between CAR expression and the proportion of transduced cells among 
5 CD34+ cells infected with AdSGFP or Ad5GFP/F35. Interestingly, for Ad5GFP, 65% of 
all GFP expressing cells were positive for a v -integrins, whereas less than 22% of GFP 
positive cells infected with the chimeric virus stained positive for Dv-integrin expression. 
While only 17% of the whole CD34+ population expressed GFP after AdSGFP infection, 
the percentage of GFP-expressing cells in the CD34+/ □ v -integrins positive fraction was 
10 50%. This indicates that AdSGFP vector-mediated GFP expression was preferentially 
localized to a v -integrin positive CD34+ subsets, whereas after infection with the 
Ad5GFP/F35 vector, GFP was expressed in a broader spectrum of CD34+ cells with 
most of them being a v -integrin-negative. 

15 Next, transduced cells were simultaneously analyzed for GFP as well as for CD34 and 
CD1 17 markers. As mentioned before, only about 90% of all cells used in our analysis 
were positive for CD34 at the time of infection, hence the multiparameter analysis for 
CD34 and GFP. A population of CD34+ cells is extraordinarily heterogeneous in 
morphology and stem cell capacity. The subpopulation of CD34+ and CD117+ cells 

20 resembles very primitive hematopoietic cells (Ikuta, K., Weissman, I.L. 1992 Proc. Natl. 
Acad. Sci. USA 89:1502-1 506; Simmons, P. 1, et al 1994. Expl. Hematology. 22:157- 
165). Fig. 23 summarizes the analyses of GFP expression in correlation with these 
specific stem cell markers. While 54% of cells infected with chimeric vector were 
positive for GFP and CD34+, only 25% of cells infected with Ad5GFP expressed the 

25 transgene and CD34+ marker (Fig, 23A, lower panel). More importantly, based on GFP 
expression, the chimeric virus transduced 80% of c-kit positive ceils, whereas the AdS- 
based vector transduced only 36% (Fig. 23 A, middle panel). In an additional experiment, 
CD34+ cells were sorted for CD117 expression prior to infection with. AdSGFP or 
Ad5GFP/F35 and, 24 hours post-infection, GFP expression was analyzed in this specific 

30 fraction (Fig. 23B). This analysis revealed that the chimeric vectors transduced 4 fold 
more CD34+/CD1 17+ than the AdSGFP vector. 
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In conclusion, these results demonstrated that the chimeric Ad5GFP/F35 vector was 
clearly superior to the AdSGFP vector in targeting and transduction of CD34+ cells. 
Furthermore, the data suggest that the spectrum of CD34+ cell subsets permissive for Ad 
5 infection was significantly different for the chimeric vector than for the Ad5 vector. 

Analysis of viral zenomes within CD34+ cells infected with the Ad5 and chimeric 
vectors: 

10 So far, the transduction rate of CD34+ cells was measured based on GFP expression after 
infection with AdSGFP and Ad5GFP/F35. Considering the extraordinary heterogeneity of 
CD34+ cells in morphological and functional parameters, GFP may not be expressed in 
all cell types' that were efficiently infected. Reasons for this include that the CMV 
promoter may not be active in all cell types or that the regulation of transgene expression 

15 could differ between subsets on a post-transcriptional or post-translational level. To test 
this, we quantified the number of intracellular (transduced) viral genomes within GFP 
positive and GFP negative fractions of CD34+ cells infected with AdSGFP and 
Ad5GFP/F35. To do this, twenty-four hours after infection, CD34+ cells were sorted for 
GFP positive and GFP negative fractions, which were subsequently used to isolate 

20 genomic DNA together with transduced viral DNA. The number of viral genomes was 
determined by quantitative Southern blot as described for Fig. 15. Per GFP-positive 
CD34+ cell, about 270 copies of the Ad5GFP/F35 viral genome were detected. 
Interestingly, a remarkable 200 copies of the Ad5GFP/F35 viral genome were found per 
GFP-negative CD34+ cell (Fig. 24A and 25). This demonstrated that not all infected cells 

25 expressed GFP and implies that the actual transduction rate was higher than 54% (GFP- 
positive cells). We concluded that the CMV promoter was not active in all transduced 
CD34+ subsets. No AdSGFP vector specific signal was detected within infected CD34+ 
(GFP positive or negative) fractions by Southern blot which had a detection limit of 1 4 
viral genomes per cell. From this, we can conclude that the vector DNA concentration per 

30 transduced cell was at least 20 times higher for Ad5GFP/F35 than for AdSGFP. 
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AdSGFP DNA was only detectable in DNA samples from infected CD34+ cells by 
Southern blot after prior PCR amplification with vector specific primers (Fig. 24B and 
25). This indicates that the replication deficient Ad5 vector is present but at a very low 
copy number, which may be limited by intracellular genome stability. Using the PCR- 
5 Southern detection method, Ad5 vector DNA was also detected in GFP negative cells, 
supporting that the CMV promoter may not have been the optimal choice for transduction 
studies. It is notable that studies by others on viral genome analyses after infection of 
CD34+ cells with Ad5 vectors were performed only after prior PCR amplification 
(Mitani, K., et al. 1994. Human Gene Therapy. 5:941-948; Neering, S. J., et al. 1996.. 
10 Blood. 88:1147-1155); 

DISCUSSION 

The chimeric Ad5GFP/F35 vector has binding and internalization properties similar to 

15 Ad35. Therefore, the fiber substitution was sufficient to swap cell tropism from Ad5 to 
Ad35. The Ad5GFP/F35 capsid chimera contained the short-shafted Ad35 fiber 
incorporated into an Ad5 capsid, instead of the naturally occurring long-shafted Ad5 
fiber. During Ad5 infection, interaction between the penton base and intergrins is 
required to induce viral internalization. For this interaction, the length of fiber shaft and 

20 the precise spatial arrangement of knob and RGD motifs are critical for the virus entry . 
strategy. The natural spatial arrangement is disturbed when short-shafted heterologous 
fibers are inserted into the Ad5 capsid. Interestingly, the Ad5/35 capsid chimera allows 
for efficient infection, suggesting that the protruding RGD motives in the Ad5 penton 
base do not affect the interaction with the primary Ad35 receptor. So far, most of the 

25 chimeric viruses were generated by substituting only the Ad5 knob while maintaining the 
long Ad5 fiber shaft (Chillon, M., et al. 1999. J. Virology. 73:2537-2540; Krasnykh, V. 
N., et al. 1996. J.Virology. 70:6839-6846; Stevenson, S. C, et al. 1995. J. Virology. 
69:2850-2857; Stevenson, S. C, et al. 1997. J.Virology, 71:4782-4790). The exception 
was an Ad5/7 chimeric virus (Gall, J., et al. 1996. J. Virology. 70:21 16-2123), where the 

30 whole Ad5 fiber was substituted by the short-shafted Ad7 fiber. However, similar to the 
parental Ad5, the Ad5/7 chimera still required <x v -integrins for infection. 
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This Ad5GFP/F35 chimera is the first demonstration that despite the presence of RGD 
motifs within the Ad5 penton, the chimeric virus uses cell entry pathways determined 
primarily by the receptor specificity of the short-shafted heterologous fiber. This does not 

5 exclude that interaction with a secondary receptor may increase binding affinity. The 
latter is supported by the observation that Ad35 and Ad5GFP/F35 slightly differed in 
their ability to compete with Ad5 or Ad3 for binding. It is possible that Ad5/35 
attachment involves, in addition to the high affinity fiber binding, interaction between 
Ad5 capsid proteins (e.g. RGD motifs) and secondary receptor/s that overlap with those 

10 used by Ad3 and Ad5. 

This data indicate that infection with Ad5-based vectors is restricted to a specific subset 
of CD34+ cells. The percentage of GFP expressing cells after AdSGFP infection of 
CD34+ cells reached a plateau at MOIs higher than 100 indicating that only a limited 

15 fraction of CD34+ cells was permissive to Ad5. Also, strong replication of wild type Ad5 
in infected CD34+ cells may be the result of preferential transduction of a specific 
subpopulation of CD34+ resulting in a expression of early viral genes at a level sufficient 
to initiate viral replication. The presence of a specific subpopulation of CD34+ cells 
permissive to Ad5-vector infection was suggested by others (Byk, T., et al, 1998. Human 

20 Gene Therapy. 9:2493-2502; Neering, S. J., et al. 1996.. Blood. 88:1147-1155). In the 
present report, we further characterized this subpopulation and demonstrated that AdS- 
based vectors preferentially infected a v -integrin positive CD34+ cells. Integrins 
(includingctv) are thought to be important for homing and trafficking of transplanted 
hematopoietic cells, however little is known about the correlation between ct v -integrin 

25 expression and the differentiation status of hematopoietic cells (Papayannopoulou, T., 
Craddock, C. 1997. Acta Haematol 97:97-104; Roy, V., Verfaillie, CM. 1999. Exp. 
HematoL 27:302-312). There was no clear correlation between CAR and GFP expression 
suggesting that AdSGFP may be able to use another membrane protein as a primary 
receptor. Alternatively, AdSGFP transduction observed at an MOI of 200-400 could be 

30 the result of direct interaction between virus and a v -integrins triggering internalization, 
which may be the preferred pathway in the absence of CAR (Legrand, V., et al. 1999. J. 
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Virology, 73:907-919). Importantly, infection with the chimeric Ad5GFP/F35 vector was 
not restricted to the ct v -positive CD34+ subpopulation. 

Among CD34+ cells, the subpopulation of CD34+ and CD117+ cells resembles very 

5 primitive hematopoietic cells (Ikuta, K., Weissman, I.L. 1992 Proc. Natl Acad Sci. USA. 
89:1502-1506; Simmons, P. J., et al. 1994. Expl Hematology. 22:157-165). The receptor 
for stem cell factor, CD1 17 (c-kit) belongs to a tyrosine kinase family. It was previously 
shown that c-kit+, CD34H- cord blood cells contain a high fraction (16%) of 
hematopoietic progenitors (Neu, S., et al. 1996. Leukemia Research. 20:960-971). Early 

10 in ontogeny 34+/CD1174- cells have long-term repopulating activity (Sanchez, M. J., et 
al. 1996. Immunity. 5:513-525). An average of 50-60% of CD34+ cells are reported to be 
CD117 positive (Ikuta, K., Weissman, I.L. 1992 Proc. Natl Acad Sci. USA. 89:1502- 
1506; Neu, S., et al. 1996. Leukemia Research. 20:960-971; Simmons, P. J., et al. 1994. 
Expl Hematology, 22:157-165). In our studies, the chimeric vector expressed GFP in 

15 54% CD34+ cells and 80%. of CD34+/c-kit+ cells. The actual viral transduction rate 
could be even higher because transduced Ad5GFP/F35 vector DNA was also found in 
GFP-negative fractions of infected cells. This indicates that the CMV promoter used to 
drive GFP expression in our vectors was not active in all transduced cells. We selected 
the CMV promoter for transgene expression based on published data demonstrating that 

20 PGK and CMV promoters allowed for efficient transgene expression in CD34 cells 
whereas the HTLV-I and RSV promoter were almost inactive (Byk, T., et al. 1998. 
Human Gene Therapy. 9:2493-2502; Case, S. S., et al. 1999. Proc. Natl Acad. Sci. USA. 
96:2988-2993). On the other hand, studies by Watanabe et al. (Watanabe, ,T., et al. 1996. 
Blood. 87:5032-5039) suggest that the CMV promoter is not active or rapidly silenced in 

25 certain CD34+ subsets. Our data underscore this observation. Considering retroviral 
transduction studies, the retroviral MLV promoter may have been a better candidate for 
transduction studies in hematopoietic cells (Bregni, M., et al. 1998. Gene Therapy. 5:465- 
472). 

30 After having demonstrated that the Ad5GFP/F35 vector efficiently transduced cells 
carrying stem cell specific markers, the next logical step would be to perform colony 
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assays with pre-sorted GFP positive/negative cells. However, this assay is complicated by 
the fact that infection with first generation Ad vectors is cytotoxic and affects the 
formation and growth of progenitor colonies in MC-cultures (Mitani, K., et al. 1994. 
Human Gene Therapy. 5:941-948; Watanabe, T., et al. 1996. Blood. 87:5032-5039). This 
5 side effect is caused by the expression of Ad proteins within transduced cells (Lieber, A., 
C.-Y. et al. 1996. Journal of Virology. 70:8944-8960; Schiedner, G., et al. 1998. Nature 
Genetics. 18:180-183; Yang, Y, ? et al. 1994. Proc. Natl. Acad ScL USA. 91:4407-441 1). 
Some of these proteins (e.g E4-orf4, pTP, or E3- 11.6k) have pro-apoptotic activity 
(Langer, S. J. 9 Schaak, J. 1996. Virology. 221:172-179; Lieber, A., et al. 1998. J. 

10 Virology. 72:9267-9277; Shtrichman, R., Kleinberger, T. 1998. J. Virology. 72:2975- 
2983; Tollefson, A. E., A et al. 1996 J. Virology. 70:2296-2306). Clearly, this would 
affect the outcome of transduction studies with Ad5GFP/F35, which allows for the 
efficient transfer of viral genomes into CD34+ cells implying significant expression of 
viral proteins. Moreover, recently published data indicate that short-term colony assay 

1 5 mostly measure mature progenitors and do not represent a rigorous test for transduction 
of potential stem cells. 

A definitive demonstration that Ad5GFP/F35 based vectors can transduce HSC requires 
colony assays or preferably, repopulation assays in SCID-NOD mice. We can perform 

20 these studies with gutless vectors (Steinwaerder, D. S., et al. 1999. J Virol 73:9303-13) 
and integrating DAd.AAV vectors devoid of all viral genes (Lieber, A., et al. 1999. J 
Virol 73:9314-24) generated based on Ad5GFP/F35 chimeric capsids. Alternatively, 
gutless, retargeted vectors could be used to transiently express a retroviral receptor on 
CD34+ cells to increase their susceptibility to infection with retroviral vectors based on 

25 an approach that we have published earlier (Lieber, A., et al.. 1995. Human Gene 
Therapy. 6:5-11). 

Our finding that Ad5GFP/F35 can efficiently transduce hematopoietic cells with potential 
stem cell capacity represents an important step towards stable gene transfer into HSCs 
30 and gene therapy of blood disorders. Furthermore, the virological aspects of this 
invention contribute to a better understanding of adenovirus cell interactions. 
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I: Retargeting of Ad5 based vectors with modified fibers carrying specific ligand 
peptides for HSC and other cell types 

5 Another alternative to make Ad5-capsid-based vectors suitable for HSC gene therapy is 
to incorporate the coding sequence for HSC specific peptides into the HI loop region of 
the Ad5 fiber gene. The modification of the HI -loop was successfully exercised by 
Krasnykh et al. with a 7 amino-acid long FLAG peptide (DYDDDDK). Using phage 
display peptide libraries (Pascqualini, R. et al., 1996, Nature* 380:364-66), Renata 

10 Pasqualini (La Jolla Cancer Research Center) reported recently, at the First Meeting of 
the American Society for Gene Therapy, the identification of small peptide ligands 
specific for bone marrow cells. The corresponding sequences encoding these peptides 
can be added to modify the HI loop sequence employing site-directed mutagenesis. 
Optimally, the ligands should allow for the efficient internalization of adenoviral particles 

15 based on a CAR- and integrin independent pathway. Modified adenoviral vectors 
containing the AAVBG cassette can be produced and tested for HSC tropism as 
described above. 

Adenovirus peptide disvlav: 



In order to retarget adenoviruses to any cell type of interest, a strategy is provided which 
involves creating a library of adenoviruses displaying random peptides in their fiber 
knobs as ligands and screening this library for adenovirus variants with tropism to a 
particular cell type in vitro and potentially in vivo. 



The development of the adenovirus peptide display technique is based on the following 
ideas, (i) Although the tertiary structure of the Ad5 fiber knob is known, it remains 
unclear which domains are involved in receptor binding. There are data suggesting that 
receptor-binding domains partially overlap with hemagglutination domains, which are 
30 well characterized for a number of serotypes. Therefore, three intramolecular loop 
regions representing potential receptor binding sites can be substituted by random peptide 
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libraries. Eight amino acid residues in the center of the FG, or GH loops can be 
substituted by octameric random peptides (Figs. 26 and 27). These substitutions will 
replace CAR tropism and allow for infection of refractory cell types, (ii) To synthesize 
the oligonucleotides encoding the peptide library a novel technique to assemble pre- 
5 synthesized trinucleotides representing the codons for all 20 amino acids is employed. 
This avoids termination codons and assures optimal codon usage and translation in 
human cells. Synthesis of a completely randomized library is possible with all 20 amino 
acids being incorporated with the same probability and a partially randomized library 
with only three (in average) random amino acids substitutions per octamer at random 

10 positions with a random amino acid to maintain certain critical features of the tertiary 
knob structure while introducing variability. The last model is based on the distribution 
of amino acids present in the hypervariable CDR 1 or 2 region of immunoglobulins, (iii) 
To maintain a representative library size of about 10 10 different octamers per modified 
loop, a new cloning strategy is employed to allow for insertion of the library into the 

15 wild-type Ad5 genome without introducing additional amino acids at the substitution site 
and without transformation into bacteria. This strategy is based on a "seamless" cloning 
technique available from Stratagen. (iv) In order to produce the library of viruses, viral 
genomic DNA containing the modified fiber sequences is transfected into 293 cells 
without reduction of the library size. This critical step is done by conjugating the viral 

20 library DNA to carrier Ad5-based adenovirus via poly-lysine to assure 100% transfection 
efficiency. This technique allows for coupling of — l|ig of plasmid DNA (or —lxl 0 10 
adenoviral genomes) to 10 10 viral particles which can be used to infect 293/cre cells at an 
MOI of 10-100. Importantly, the carrier adenoviral genome has the packaging signal 
flanked by lox sites preventing the packaging of carrier viral DNA after infection of 293 

25 cells that express ere recombinase (293/cre). This helper virus system is routinely used to 
produce so-called gutless adenoviruses. Therefore, the virus progeny represents library 
genomes packaged into capsids containing preferentially Ad5 fibers. This is important 
for the next infection step into 293 cells at a MOI of 1 to assure a homogeneous fiber 
population on the capsid where the fibers are encoded by the packaged genome. 

30 

J. Production Of Adenovirus Vectors With Increased Tropism To Hepatocvtes 
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An example of a G-H loop substitution to target Ad5 to hepacytes was successful. 
Preliminary tests demonstrated that two evolutionarily conserved regions within the 
malaria circumsporozoite surface protein (CS) termed RI and RII+ mediate specific 
5 interaction with hepatocytes but not with other organs (including spleen, lung, heart and 
brain), nor with Kupffer cells, liver endothelial cells or with other regions of the 
hepatocyte membrane (Cerami, C. et ah, 1992, Cell 70:1021-33; Shakibaei, M. and U. 
Frevert, 1996 , J. Exp. Med., 184:1699-711). These regions are conserved among 
different species including Plasmodium berghei, P. cynomogli, and P. falciparum that 

10 infect mouse, monkey and human hepatocytes, respectively (Cerami, C. et al., 1992, Cell, 
70:1021-33; Chatterjee, S. et al., 1995 , Infect Immun., 63:4375-81). Peptides derived 
from RI (KLKQPG) or RII (EWSPCSVTCGNGIQVRIK) blocked CS binding to 
hepatocytes and infection by spprozoites in vivo ((Cerami, C. et al., 1992, Cell 70:1021- 
33; Chatterjee, S. et al., 1995 , Infect Immun., 63:4375-81). RI and RII+ peptides were 

15 separately inserted into Ad5-fiber knob (H-I and G-H loop) containing mutation with 
abolished binding to CAR and alpha-v integrins (Kirby, L. et al., 2000, J. Virol, 74:2804- 
13; Wickham, T. J. et aL, 1995 , Gene Ther. t 2:750-6). Based on preliminary data, a 
short-shafted fiber was used so that the virus entry strategy predominantly depends on the 
interaction with the primary (hepatocyte-specific receptor). The hepatocyte-specific 

20 ligands are flanked by short glycine stretches to provide flexibility and embedded into a 
loop formed by two cystines. This is one of the classical strategies to incorporate ligands 
into a protein scaffold (Doi, N. and H. Yanagawa, 1998 , Cell Mol Life Set, 54:394-404; 
Koivunen, E. et al., 1995 , Biotechnology (NY), 13:265-70) and to guarantee their 
presentation at the protein surface. The biodistribution of the best variants is tested in 

25 vivo in C57B 1/6 mice based on Southern blots or PCR for vector DNA in different 
organs. This mouse strain in known to be susceptible to infection with P. berghei 
(Chatterjee, S. et al., 1995, Infect Immun., 63:4375-81). 
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K. Production Of Adenovirus Vectors With Increased Tropism To Tumor Cells 

A similar strategy is to insert two peptides obtained after selection for tumor tropism by 
displaying random peptides on filamentous phages. The first double cyclic peptide 

5 (RGD-4) proved to bind specifically to integrins present on tumor vasculature (Ellerby H. 
M. et al. s 1999 . Nat. Med, 5:1032-8). The second peptide targets specific matrix 
me tall ©proteinases associated with metastatic tumor cells as shown for the breast cancer 
cell line MDA-MB-435 (Koivunen, E. et al., 1999 . Nat. BiotechnoU 17:768-74). 
Tropism-modified vectors are tested in animal models with hepatic metastases derived 

1 o from MDA-MB-435 cells (Fig. 28). 

L. Development Of A Peptide Display Technique Based On Adenoviruses 

A synthetic peptide library is described that allows adenovirus vectors to express random 

15 peptides in the G-H loop of the fiber knob domain. The technique of a phage display 
library is optimized to generate a library of adenoviruses displaying random peptides in 
their fiber knob. This library of adenovirus variants is then screened for tropism to a 
particular cell type in vitro and potentially in vivo. The oligonucleotides encoding the 
peptide library employ a novel technique to assemble pre-synthesized trinucleotides 

20 representing the codons for all 20 amino acids. This will end the termination codons and 
assure optimal codon usage and translation in human cells. To maintain a representative 
library size, a new "seamless" cloning strategy that allows for insertion of the library into 
the wild-type Ad5 genome without introducing additional amino acids at the substitution 
site and without transformation into bacteria. Transfection into 293 cells is done by 

25 conjugating the viral library DNA to carrier Ad5-based adenovirus via polylysine to 
assure a 100% transfection efficiency. Importantly, the carrier adenoviral genome has its 
packaging signal flanked by lox sites preventing the packaging of carrier viral DNA after 
infection of 293 cells that express Cre recombinase (293/cre). The library is produced 
with El -positive viruses depleted for CAR and integrin tropism. Only variant that have 

30 successfully infected the cell type of interest will replicate, resulting in de novo produced 



98 





WO 00/73478 



PCT/US00/15442 



virus. The sequence of the peptide ligand that conferred the particular tropism will then 
be analyzed in do novo produced virus. 



COMBINATION NOVEL ADENOVIRAL VECTOR AND MODIFIED FIBER 
PROTEIN 

This example describes the following studies which combine the technology of the 
10 integrating adenovirus vector that is devoid of all adenoviral genes with the modified 
fiber protein that retargets the vector to quiescent HSC. 

A. Transduction studies with retargeted vectors in HSC: 

15 In order to transduce quiescent HSC and integrate into chromosomal DNA, retargeted 
AAd.AAV* vectors are tested for reporter gene expression, and vector integration 
simultaneously while analyzing their clonogenic capacity. The modified AAd.AAV** 
hybrid vectors contain genomes devoid of all adenoviral genes (a "gutless" adenovirus 
vector) packaged into Ad5 capsids with modified fibers. Rep may be incorporated into 

20 these AAd.AAV* vectors to allow for site-specific integration into AAVS1 . 

Transduction studies: 

Purified human CD34+ cells in IMDM/FCS+IL-3 and SCF are infected with different 
25 doses of AAd.AAV*-BG (1-10 7 genomes per cell). CD34+ cells infected with 
AAd.AAV^-PGal are cultured for 2 days in suspension and sort P-Gal+ cells by FACS 
using FDG as substrate. This determines the infection efficiency, fi-gal expressing cells 
are then submitted to clonogenic assays in semi-solid cultures (in two dishes per MOI) in 
the presence of multiple cytokines. (IL-3, SCF, Epo, G-CSF, GM-CSF, DL-7, Tpo). A 
30 first set of semi-solid cultures can be evaluated after 7 days; another set can be analyzed 
after 14 days. Colonies that have formed in semisolid culture can be characterized by 
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light microscopy and subsequently stained with X-Gal staining. Most of the vector 
genomes should remain episomal and can be lost with successive cell divisions. Thus, 
while most cells can be X-Gal positive at day 2 or day 7 after infection, most of the larger 
colonies (analyzed at day 14 p.i.) may not stain homogeneously for P-Gal. A 
5 representative number of X-Gal positive and X-Gal negative colonies can be picked and 
analyzed for episomal and integrated vector DNA. The outcome depends on the MOI 
used for infection and the integration status of the vector. These studies determine 
whether hybrid vectors can infect primitive progenitors. 

10 Detailed characterization of hybrid vector integration: 

CD34+ cells can be infected with AAd.AAV^-SNori (MOI 1-1 0 7 ) and subjected to G418 
selection in methyl cellulose (MC) cultures in the presence of growth factors (IL-3 and 
SCF). The resulting colonies are a mixture of mainly myeloid cells. The number and 

15 morphology of G418 resistant colonies can be determined after 2 weeks of selection. 
This strategy may be disadvantageous in that the appropriate stem cell may not divide 
and form G418 resistant colonies under the specific culture condition used. Moreover, it 
may be difficult to perform G418 selection on a population of heterogenous cells, which 
vary in their sensitivity to G418. Therefore, another set of AAd.AAV f *-SNori infected 

20 CD34+ cells can be cultured in methyl-cellulose (+IL-3, SCF) without G418 selection. 
After 2-3 weeks, single colonies can be picked from both (w/ and w/o G418) MC 
cultures, morphologically ' characterized, and analyzed for integrated vector using the 
modified protocol developed for integration studies in a small number of cells (see Figure 
8). This strategy allows the assessment of whether hybrid vectors integrate into the 

25 genome of CD34+ cells cultured in the presence of growth factors. This study gives us 
an idea about potential position effects affecting neo or Pgal expression from integrated 
vector copies and about the structure of the integrated vector and the flanking 
chromosomal regions. 

30 An alternative method to confirm vector integration: 
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Fluorescence in situ hybridization (FISH) analysis, can be performed in individual cells 
from MC colonies. CD34+ cells are cultured in MC in the presence of growth factors to 
induce cell division and subsequently treated with colchicine. Metaphase chromosome 
spreads are analyzed with biotin-ATP labeled probe specific for the fJGal or SNori gene 
5 and a dioxigenin-UTP labeled probe for the human X-chromosome as an internal control 
(provided by Christine Disteche, University of Washington). Specific hybridization can 
be visualized with corresponding anti-biotin or anti-DIG antibodies labeled with different 
fluorochromes (e.g. FITC and Texas Red). Hybrid vector DNA may integrate as 
concatemers, which would facilitate detection by FISH, This technique allows one to 
10 localize the chromosomal integration sites of hybrid vectors. 

Test transduction into quiescent bone marrow subpovulations: 

Hybrid vectors described so far can be tested to see whether quiescent CD34+ cells can 

15 be stably transduced. To avoid significant cell proliferation, purified CD34+ cells are 
cultured in serum free IMDM supplemented with thrombopoietin (Tpo). Tpo can alone 
support the survival of stem cells without stimulating their active cell proliferation 
(Matsunaga, T. et al., 1998, Blood, 92:452-61; Papayannopoulou, T. et al., 1996, 
Experimental Hematology, 24:660-69). To analyze the proliferation status of CD34+ cells 

20 at the time point of infection with the hybrid vector AAd.AAV^BG, BrdU is added 2 
hours before infection to the culture medium. One set of cells are maintained as 
suspension culture in ID AM containing Tpo only for two days. Another set of cells are 
grown in IDAM+Tpo supplemented with multiple cytokines. Forty eight hours after 
infection, CD34+ cells can be FACS sorted for beta Gal expression using FDG. FDG 

25 positive cells can be further analyzed for cellular DNA replication based on BrdU 
incorporation and for specific CD34+ subset markers. To do this, cytospins from FDG+ 
cells can be submitted to immunofluorescence with BrdU specific antibodies and with 
antibodies to specific cell surface markers (e.g. CD38, CD41). Alternatively, consecutive 
paraffin sections of the same cell can be analyzed for (a) transgene expression by X-Gal 

30 staining, (b) DNA synthesis based on BrdU incorporation, and (c) specific surface 
markers. This allows one to confirm that the culture conditions with Tpo alone prevent 



101 




WO 00/73478 3 PCT/US00/15442 

significant genomic DNA replication and subsequent cell proliferation as well as to 
determine whether quiescent CD34+ cells can be infected based on beta Gal expression 
in cells where BrdU labeling is absent. 

5 Test hybrid vectors integration into quiescent CD34+ cells: 

Two sets of CD34+ cells are infected. The first set of □ Ad.AAV fx SNori infected cells are 
cultured for 5-7 days in the presence of cytokines; the other set is cultured without 
cytokines. To maintain CD34+ cell viability without cytokines during this period, the 

10 cells are cultured in the presence of Tpo or underlaid with a stromal cell line (AFT024) 
(Moore, K. et al., 1997, Blood, 89:4337-47), which can maintain HSC viable for 4 to 7 
weeks. After this specific time period, both sets are submitted to clonogenic assays (in 
the presence of multiple cytokines) either in combination with G418 selection or without 
selection. Single colonies are analyzed morphologically and submitted to genomic DNA 

1 5 analysis (Figure 8) to determine the vector integration status. The ultimate proof for stem 
cell transduction is the in vivo survival/expansion assay. To do this, the CD34+ cells 
expressing beta Gal are used for transplantation experiments, If the number of FDG+ 
cells is not sufficient, total AAd.AAV fx BG infected cells as well as all AAd.AAV^-SNori 
infected cells can be used directly without selection. Transplantation can be performed 

20 via tail vein injection into sublethally irradiated SCID NOD mice (Dao, M.A., et al., 
1998, Blood, 4, 1243-1255; Matsunaga, T. et al., 1998, Blood, 92:452-61). At different 
time points after transplantation (4 to 8 weeks), mice can be sacrificed to obtain bone 
marrow cells which then can be cultured in suspension until various assays are performed 
for X-Gal and cell markers as described earlier. These cells also can be submitted to a 

25 secondary colony assay in MC or secondary transplantation into SCID NOD mice. . 
Furthermore, MC colonies derived from these cells can be analyzed for the presence of 
integrated vector DNA by the method illustrated in Figure 8. The expression and 
integration data together allow conclusions about the repopulation efficiency and about 
potential position effects. 

30 
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B. Optimization of AAd.AAV f * vectors for y r-globin expression in hematopoietic 
stem cells: 

One specific example of the invention is (a) to construct retargeted hybrid vector with 
5 the y -globin as the transgene under the control of erythroid cell specific promoter, (b) to 
analyze the level and kinetics of y -globin expression after transduction with hybrid 
vectors in in vitro and in vivo assays, (c) if required, to protect gene expression from 
position effects using y -globin LCRs or insulators incorporated into hybrid vectors, and 
(d) to study whether y -globin introns or heterologous introns can increase y -globin 
10 expression. 

Another central issue of the invention is to demonstrate that hybrid vectors can 
accommodate larger transgenes than rAAV and retroviruses. The insert size limitation of 
these vectors is 5kb. Transgene cassettes up to 8kb can be inserted into hybrid vectors as 

15 described. The maximal insert size may be about Hkb, if hybrid vectors are produced on 
the basis of E2a and/or E4 deleted rAd vectors in corresponding packaging cell lines. The 
maximal insert size in hybrid vectors is dictated by the packaging limit of first generation 
vectors (Ad. AAV) (<36kb) which are necessary intermediates for hybrid virus production 
at large scale. It is expected that stability and titer of Ad.AAV vectors with an 8kb globin 

20 gene cassette is comparable to the vector containing the 2.5-3. 5kb cassette used in 
Ad.AAVBG, Ad.AAVl, and Ad.AAVSNori. The following example experiments 
address these issues. 

Production of AAd.AA J^with large zlobin expression cassettes: 

25 

In order to improve the condition of sickle cell disease, the expression level of the 
transferred y-lobin gene must be at least 50% of that of each endogenous {Jgene. These 
levels of transgene expression can only be achieved by using optimal expression 
cassettes, including extended LCRs and intron containing gamma genes. (Forrester, 
30 W.C., et al., 1986, Proc. Natl Acad. Set USA 83, 1359-1363; Fraser, P., et al., 1998, 
Curr. Opinion in Cell Bio., 10, 361-365; Grosveld, F., et a.l, 1998, Seminars in 
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Hematology, 35, 105-111; Martin, D. et al., 1996, Current Opinion in Genetics and 
Development, 6:488-95), So far, most of the y-globin expression cassettes are designed 
for retroviral and rAAV vectors, thus, less than 5kb and have to be devoid of internal 
splice sites or poly adenylation signals. With integrating vectors described herein, it is 

5 possible to go beyond this size limitation. This allows one to improve y -globin 
expression in bone marrow cells in terms of an adequate expression level and long term 
persistence. For this purpose, y -globin constructs developed by Li et al (Emery, D. W., 
et al. 1999 Hum Gene Ther 10:877-88; Li, Q., et al. 1999. Blood 93:2208-16) or by Ellis 
et al (Ellis, J., et al., 1996, EMBO J., 15, 562-568; Ellis, J., et al., 1997, Nucleic Acids 

10 Res. 25, 1296-1302) is chosen. 

(i) The first cassette contains a y -globin expression unit used in retroviral vectors. 
This allows for a direct comparison between the two systems. This construct includes the 
beta promoter from -127 to the beta initiation codon, which is connected in frame with 

15 the gamma coding region. This beta promoter is combined with the 300bp HS40 derived 
from the human alpha globin locus, which acts as a strong enhancer for globin 
expression. The globin gene is the 1.1 kb version with intron 1 and partially deleted intron 
2. A second cassette is generated containing the HS40 beta promoter and gamma globin 
gene with the complete 3.3kb gamma globin gene. 

20 

(ii) The second construct contains the 6.5kb beta |LiLCR, which confer a dominant 
chromatin opening activity and an adequate level of gamma globin expression in 
transgenic mice. The LCR is linked to the short 1.1 kb version of the gamma globin gene 
or the complete 3.3kb gamma gene. 

25 

(iii) Additional globin expression cassette can be generated which include 
insulators, MARs or SARs, as well as other elements that can improve transgene 
expression from integrated vectors or in transgenic animals, like introns derived from the 
HPRT or hGH genes (Chung, J.H., et al., 1997, Proc. Natl Acad. Set. USA 94, 575-580; 

30 Dunaway, M., et al, 1993, Mol Cell Biol, 17, 182-189; Felsenfeld, G., et al., 1996, 
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Proc, Natl Acad ScL USA 93, 93840-9388; Klehr, D., et aL, 1991, Biochemistry, 30, 
1264-1270). 

Transduction studies with AAd.AA Vfx-globin vectors: 

5 

Transduction studies with globin-hybrid vectors are performed as described earlier 
(Steinwaerder, D. S., et al. 1999. J Virol 73:9303-13). Transduced CD34+ cells are 
submitted to differentiation in colony assays or analyzed in vivo expansion assays in 
SCBD-NOD mice. MC-colonies or bone marrow cells from experimental mice are 
10 analyzed for globin expression. Gamma-globin expression is measured using fluorescent 
anti-gamma-globin antibodies. RNAase protection studies can be performed to 
specifically quantitate gamma globin mRNA in comparison with. -globin RNA. For these 
studies around 10 4 -10 5 cells are needed per test. 

15 Position effects: 

In the absence of the LCR, globin genes are subjected to strong position effects when 
they are transferred into cultured CD34+ cells or erythroleukemic lines (Eraser, P., et al., 
1998, Curr. Opinion in Cell Bio., 10, 361-365; Grosveld, F., et a.l, 1998, Seminars in 

20 Hematology, 35, 105-111). Another concern is that site-specific integration of 
AAd. AAV/rep vectors into AAVS1 may silence transgene expression. If silencing 
happens, it can be overcome by incorporating LCRs such as the 6.5kb □ globin ^LCR 
(Ellis, J., et al., 1996, EMBO J. 9 15, 562-568; Grosveld, F., et a.l, 1998, Seminars in 
Hematology, 35, 105-1 1 1) or insulators into DAd.AAV based expression units. Insulators 

25 are DNA elements that protect an integrated reporter gene from chromosomal position 
effects or that block enhancer activated transcription from a downward promoter. 
Insulator elements are known for Drosophila melanogaster genes (Gypsy, suppressor of 
Hairy wing, scs, scs', Fab-7). for the chicken beta-globin gene (HS4) and for the T cell 
receptor (BEAD1; 14, 21.25). Specifically, the Drosophila gypsy or the beta globin 

30 insulator can be inserted as two copies flanking the globin expression cassette into hybrid 
vectors. The position effects can be examined in transduced MC-colonies based on the 
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analysis of integrated vector DNA (see Fig. 29) and gamma-globin mRNA quantification. 
Analogous studies can be performed on transduced human bone marrow cells obtained 
after transplantation of infected CD34+ cells into SCID-NOD mice. 

5 Intron effects on gamma-globin expression: 

A number of reports reveal that the deletion of globin introns, particularly the second 
intron of the beta and gamma genes, decrease globin mRNA stability and thus the 
expression level (Antoniou, M. et al., 1998, Nucleic Acid Res. % 26:721-9). RNA viruses 
10 such as onco-retro, lenti- and foami viruses are problematic as vehicles for intron- 
containing transgenes. Because AAd.AAV is a DNA virus, it should package globin 
introns and LCRs if necessary without the decreased titers and rearrangements observed 
with retroviral vectors. 
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APPENDIX I 

HUMAN AND ANIMAL ADENOVIRUSES AVAILABLE FROM 
5 AMERICAN TYPE CULTURE COLLECTION 

1: Adenovirus Type 21 ATCC VR-1099 (NIAID V-221 -002-0 14> 
?• KA18 (Simian adenovirus 18*1 ATCC VR-943 Classification 

10 3: SA17 (Simian adenovirus 171 ATCC VR-942 Classification 
4: Adenovirus Type 47 ATCC VR-1309 Classification: Adenov 
5: Adenovirus Type 44 ATCC VR-1306 Classification: Adenov 
6: Avian adenovirus Type 4 ATCC VR-829 Classification: Ad 
7: Avian adenovirus Type 5 ATCC VR-830 Classification: Ad 

15 8: Avian adenovirus Type 7 ATCC VR-832 Classification: Ad 
9: Avian adenovirus Type 8 ATCC VR-833 Classification: Ad 
10: Avian adenovirus Type 9 ATCC VR-834 Classification: Ad 
11: Avian adenovirus Type 10 ATCC VR-835 Classification: A 
12: Avian adenovirus Tvoe 2 ATCC VR-827 Classification: Ad 

20 13: Adenovirus Tvoe 45 ATCC VR-1307 Classification: Adenov 
14: Adenovirus Type 38 ATCC VR-988 Permit: PHS permit requ 
IS: Adenovirus Type 46 ATCC VR-1308 Classification: Adenov 
16: Simian adenovirus ATCC VR-541 Classification: Adenovir 
17: SA7 (Simian adenovirus 16t ATCC VR-941 Classification: 

25 18: Frog adenovirus (FAV-n ATCC VR-896 Classification: Ad 
19: Adenovirus Type 48 (candidate^ ATCC VR-1406 Classifica 
20: Adenovirus Type 42 ATCC VR-1304 Classification: Adenov 
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21: Adenovirus type 49 (candidate) ATCC VR-1407 Classifica 
22: Adenovirus Type 43 ATCC VR-1305 Classification: Adenov 
23: Avian adenovirus Type 6 ATCC VR-831 Permit: USDA permi 
24: Avian adenovirus Type 3 (Inclusion body hepatitis virus) 
5 25: Bovine adenovirus Type 3 ATCC VR-639 Classification: A 
26: Bovine adenovirus Type 6 ATCC VR-642 Permit: USDA perm 
27: Canine ad enovirus ATCC VR-800 Classification: Adenovir 
28: Bovine adenovirus Type 5 ATCC VR-641 Permit: USDA perm 
29: Adenovirus Type 36 ATCC VR-913 Classification: Adenovi 

io 30: Ovine adenovirus type 5 ATCC VR-1343 Classification: A 
31: Adenovirus Type 29 ATCC VR-272 Classification: Adenovi 
32: Swine adenovirus ATCC VR-359 Classification: Adenoviru 
33: Bovine adenovirus Type 4 ATCC VR-640 Permit: USDA perm 
34: Bovine adenovirus Type 8 ATCC VR-769 Permit: USDA perm 

is 35: Bovine adenovirus Type 7 ATCC VR-768 Permit: USDA perm 
36: Adeno-associated virus Type 2 (AAV-2HP> ATCC VR-680 Cla 
37: Adenovirus Type 4 ATCC VR-4 Classification: Adenovirus 
38: Adeno-associated virus Type 3 (AAV-3IT! ATCC VR-681 Cla 
39: Peromvscus adenovirus ATCC VR-528 Classification: Aden 

20 40: Adenovirus Type 15 ATCC VR-661 Classification: Adenovi 
41 : Adenovirus Tvne 20 ATCC VR-662 Classification: Adenovi 
49- rhimpflnzee adenovirus ATCC VR-593 Classification: Aden 
43: Adenovirus Type 31 ATCC VR-357 Classification: Adenovi 
44: Adenovirus Type 25 ATCC VR-223 Classification: Adenovi 

25 
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45: Chimpanzee adenovirus ATCC VR-592 Classification: Aden 
46: Chimpanz ee adeno virus ATCC VR-591 Classification: Aden 
47: Adenovirus Tvoe 26 ATCC VR-224 Classification: Adenovi 
48: Adenovirus Tvoe 19 ATCC VR-254 Classification: Adenovi 
5 49: Adenovirus Type 23 ATCC VR-258 Classification: Adenovi 
50: Adenovirus Type 28 ATCC VR-226 Classification: Adenovi 
51: Adenovirus Type 6 ATCC VR-6 Classification: Adenovirus 
52: Adenovirus Type 2 Antiserum: ATCC VR-1079 AS/Rab (NIA 
53: Adenovirus Type 6 ATCC VR-1083 (NIAID V-206-003-014^ 

10 54: Ovine adenovirus type 6 ATCC VR-1340 Classification: A 

55: Adenovirus Type 3 ATCC VR-847 (Derived from NIAID V-2(T> 
56: Adenovirus Type 7 ATCC VR-7 Classification: Adenovirus 
57: Adenovirus Type 39 ATCC VR-932 Classification: Adenovi 
58: Adenovirus Type 3 ATCC VR-3 Classification: Adenovirus 

15 59: Bovine adenovirus Type 1 ATCC VR-3 13 Classification: A 
60: Adenovirus Type 14 ATCC VR-15 Classification: Adenovir 
61: Adenovirus Type 1 ATCC VR-1078 (NIAID V-201-001-014^ 
62: Adenovirus Type 21 ATCC VR-2S6 Classification: Adenovi 
63: Adenovirus Type 18 ATCC VR-1095 (NIAID V-21 8-003 -014^ 

20 64: Baboon adenovirus ATCC VR-275 Classification: Adenovir 
65: Adenovirus Type 10 ATCC VR-11 Classification: Adenovir 
66: Adenovirus Type 33 ATCC VR-626 Classification: Adenovi 
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67: Adenovirus Type 34 ATCC VR-716 Classification: Adenovi 
68: Adenovirus Type 15 ATCC VR-16 Classification: Adenovir 
69: Adenovirus Type 22 ATCC VR-257 Classification: Adenovi 
70: Adenovirus Type 24 ATCC VR-259 Classification: Adenovi 
5 71: Adenovirus Type 17 ATCC VR-1094 fNIAID V-2 17-003 -014^ 
72: Adenovirus Type 4 ATCC VR-1081 fNIAID V-204-003-014') 
73: Adenovirus Type 16 ATCC VR-17 Classification: Adenovir 
74: Adenovirus Type 17 ATCC VR-18 Classification: Adenovir 
75: Adenovirus Tvoe 16 ATCC VR-1093 fNIAID V-216-003-014) 

10 76: Bovine adenovirus Type 2 ATCC VR-314 Classification: A 
77: SV-30 ATCC VR-203 Classification: Adenoviru s, Simian ( 
78: Adenovirus Type 32 ATCC VR-625 Classification: Adenovi 
79: Adenovirus Tvoe 20 ATCC VR-255 Classification: Adenovi 
80: Adenovirus Type 13 ATCC VR-14 Classification: Adenovir 

15 81: Adenovirus Type 14 ATCC VR-1091 (NIAID V-214-001-014') 
82: Adenovirus Type 18 ATCC VR-19 Classification: Adenovir 
83: SV-39 ATCC VR-353 Classification: Adenovirus. Simian ( 
84: Adenovirus Type 1 1 ATCC VR-849 fPerived from NIAED V-2 
85: Duck adenovirus (Egg drop syndrome') ATCC VR-921 Permi 

20 86: Adenovirus Type 1 ATCC VR-1 Classification: Adenovirus 
87: Chimpanzee adenovirus ATCC VR-594 Classification: Aden 
88: Adenovirus Tvoe 15 ATCC VR-1092 (NIAID V-2 15-003-014^ 
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89: Adenovirus Type 13 ATCC VR-1090 (NIAED V-213-003-014") 
90: Adenovirus Tvae 8 ATCC VR-1368 (Derived from NIAID V-20 
91: SV-31 ATCC VR-204 Classification: Adenovirus, Simian ( 
92: Adenovirus Type 9 ATCC VR-1086 (NIAID ¥-209-003-014") 
93: Mouse adenovirus ATCC VR-550 Classification: Adenoviru 
94: Adenovirus Type 9 ATCC VR-10 Classification: Adenoviru 
95: Adenovirus Type 41 ATCC VR-930 Classification: Adenovi 
96: CI ATCC VR-20 Classification: Adenovir us, Simian (Mast 
97: Adenovirus Tvoe 40 ATCC VR-931 Classification: Adenovi 
98: Adenovirus Type 37 ATCC VR-929 Classification: Adenovi 
99: Marble spleen disease virus (Hemorrhagic enteritis virus 
100: Adenovirus Type 35 ATCC VR-718 Classification: Adenovi 
101: SV-32 flVm ATCC VR-205 Classification: Adenovirus. Sim 
102: Adenovirus Type 28 ATCC VR-1 106 (NIAID V-228-003-014") 
103: Adenovirus Type 10 ATCC VR-1087 (NIAID V-210-003-014') 
104: Adenovirus Type 20 ATCC VR-1097 (NIAID V-220-003-014) 
105: Adenovirus Type 21 ATCC VR-1098 (NIAID V-221-01 1-014") 
106: Adenovirus Type 25 ATCC VR-1 103 (NIAID V-225-003-014") 
107: Adenovirus Type 26 ATCC VR-1 104 (NIAID V-226-003-014) 
108: Adenovirus Type 31 ATCC VR-1 109 (NIAID V-23 1-001-014') 
109: Adenovirus Tvoe 19 ATCC VR-1096 (NIAID V-2 19-002-0 14^ 
110: SV-36 ATCC VR-20 8 Classification: Ade novirus, Simian ( 
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111: SV-38 ATCC VR-355 Classification: Adenovirus. Simian f 
112: SV-25 (MS) ATCC VR-201 Classification: Adenovirus. Sim 
113: SV-15 (M4) ATCC VR-197 Classification: Adenovirus. Sim 
1 14: Adenovirus Type 22 ATCC VR-1 100 (MAID V-222-003-0141 
5 115: SV-23 (M2) ATCC VR-200 Classification: Adenovirus. Sim 
1 16: Adenovirus Type 11 ATCC VR-1 2 Classification: Adenovir 
117: Adenovirus Type 24 ATCC VR-1 102 (MAID V-224-003-014 , > 
118: Avian adenovirus Type 1 (Chicken Embryo Lethal Orphan: C 
1 19: SV-11 (MS) ATCC VR-1 96 Classification: Adenovirus. Sim 

io 120: Adenovirus Type 5 ATCC VR-5 Classification: Adenovirus 
121 : Adenovirus Type 23 ATCC VR-1 101 (NIAID V-223-003-014'> 
122: SV-27 (M9) ATCC VR-202 Classification: Adenovirus. Sim 
123: Avian adenovirus Type 2 (GAD ATCC V-280 Classificati 
124: SV-1 (Ml) ATCC VR-195 Classification: Adenovi rus, Kimi 

is 125: SV-17 (M6) ATCC VR-198 Classification: Adenovirus. Sim 
126: Adenovirus Type 29 ATCC VR-1 107 fNIAED V-229-003-0141 
127: Adenovirus Type 2 ATCC VR-846 Classification: Adenovir 
128: SV-34 ATCC VR-207 Classification: Adenovirus. Simian ( 
129: SV-20 (M7) ATCC VR-1 99 Classification: Adenovirus. Sim 

20 130: SV-37 ATCC VR-209 Classification: Adenovirus. Simian ( 
131 : SV-33 (Ml 01 ATCC VR-206 Classification: Adenovirus. Si 



112 

SUBSTITUTE SHEET (RULE 26) 





WO 00/73478 



PCT/US00/15442 



132: Avian adeno-associated virus ATCC VR-865 Classificatio 
133: Adeno-associated (satellited virus Type 4 ATCC VR-646 
134: Adenovirus Type 30 ATCC VR-273 Classification: Adenovi 
135: Adeno-associated (satellited virus Type 1 ATCC VR-645 
5 136: Infectious canine hepatitis (Rubarfh's disease. Fox ence 

137: Adenovirus Type 27 ATCC VR-1105 (NIAID V-227-003-014d 
138: Adenovirus Type 12 ATCC VR-863 fDerived from NIAID V-2 
139: Adeno-associated virus Type 2 (molecularlv cloned") ATCC 
140: Adenovirus Type 7a ATCC VR-848 (Derived from NIAID V-2 



10 
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What is claimed: 



1. 

5 
10 

2. 

15 

3. 

20 

4. 

25 

5. 

30 



A first generation recombinant adenovirus vector a portion of which integrates 
into a host cell genome, comprising: 

a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3' to the left adenovirus inverted 
terminal repeat sequence; 

c) A transgene cassette sequence 3' to the adenoviral packaging sequence; 

d) At least one adenoviral sequence which directs adenoviral replication; and 

e) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of the 
transgene cassette sequence into the host cell genome. 

The adenovirus vector of claim 1 , wherein the left and right adenovirus inverted 
repeat sequence and the packaging sequence are from the same adenoviral 
serotype. 

The adenovirus vector of claim 1 , wherein the sequence which directs adenoviral 
replication comprises a sequence on the anti-parallel strand which encodes an 
adenoviral fiber protein including a fiber tail, a fiber shaft, and a fiber knob, 
wherein the fiber knob includes a G-H loop region. 

The adenovirus vector of claim 3, wherein the sequence on the anti-sense strand 
which encodes the fiber tail is from the same serotype as the adenoviral inverted 
repeat sequence. 

The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3' to the left inverted terminal repeat 
sequence; 
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c) A transgene sequence 3* to the polyadenylation sequence; 

d) A promoter sequence 3' to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 



The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A promoter sequence 3' to the left inverted terminal repeat sequence; 

c) A transgene sequence 3* to the promoter sequence; 

d) A polyadenylation sequence 3' to the transgene sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 



The adenovirus vector of claim 5 or 6 f wherein the left and the right cassette 
inverted terminal repeat sequences each comprise an adenoviral-associated 
inverted terminal repeat sequence. 

The adenovirus vector of claim 5 or 6, wherein the transgene sequence is selected 
from a group consisting of a therapeutic gene, a selectable gene, and a reporter 
gene. 

The adenovirus vector of claim 8, wherein the therapeutic gene is selected from a 
group consisting of gamma globin, and human alpha- 1 anti-trypsin. 

The adenovirus vector of claim 8, wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline, and gentamycin. 

The adenovirus vector of claim 8, wherein the reporter gene is selected from a 
group consisting of green fluorescent protein, beta galactosidase, alkaline 
phosphatase. 



sequence. 



115 



WO 00/73478 



PCT/USOO/15442 



12. The transgene cassette of claim 5 or 6 further comprising an inverted repeat 
sequence located 3' to the left inverted terminal repeat sequence or located 5' to 
the right inverted terminal repeat sequence. 

13. The transgene cassette of claim 5 or 6, further comprising an insulator element. 

14. The transgene cassette of claim 5 or 6, further comprising a bacterial origin of 
replication. 

15. The adenoviral vector of claim 1, wherein the adenoviral sequences which direct 
. adenoviral replication are selected from a group consisting of E2 and E4; El, E2 

and E4; E2 and E4; and E2, E3 5 and E4. 

16. A first generation recombinant adenovirus vector which targets a host cell of 
interest and a portion of which integrates into the host cell genome so targeted, 
comprising two DNA strands, each strand being antiparallel to the other, the first 
strand comprising: 

a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3 r to the left adenovirus inverted 
terminal repeat sequence; 

c) A transgene cassette sequence 3' to the adenoviral packaging sequence; 

d) At least one adenoviral sequence which directs adenoviral replication; and 

e) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of the 
transgene cassette sequence into the host cell genome, and wherein the second 
strand comprises a sequence which encodes an adenoviral fiber protein that 
permits targeting of the vector into the host cell of interest. 
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1 7. The adenovirus vector of claim 1 6, wherein the adenoviral protein includes a fiber 
tail, a fiber shaft, and a fiber knob, wherein the fiber knob includes a G-H loop 
region. 

5 18. The adenovirus vector of claim 16, wherein the left and right adenovirus inverted 
terminal repeat sequences and the packaging sequence are from the same 
adenoviral serotype. 

19. The adenovirus vector of claim 17, wherein the fiber tail is from the same 
10 serotype as the left and right adenoviral inverted repeat sequences. 

20. The adenovirus vector of claim 17, wherein the fiber shaft is from a different 
serotype as the left and right adenoviral inverted repeat sequences. 

15 21. The adenovirus vector of claim . 20, wherein the fiber shaft is from a serotype 
selected from a group consisting of serotype 3, 7, 9, 1 1, and 35. 

22. The adenovirus vector of claim 17, wherein the fiber shaft comprises a shortened 
length. 

20 

23. The adenovirus vector of claim 17, wherein the fiber knob is from a different 
serotype as the left and right adenoviral inverted repeat sequences. 

24. The adenovirus vector of claim 23, wherein the fiber knob is from a serotype 
25 selected from a group consisting of serotype 3, 7, 9, 1 1, and 35. 

25. The adenovirus vector of claim 17, wherein the fiber knob is a modified fiber 
knob protein comprising the G-H loop replaced with a heterologous peptide 
ligand sequence which binds to at least one surface protein on the host cell of 

30 interest. 
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26. The adenovirus vector of claim 16, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3' to the left inverted terminal repeat 
sequence; 

c) A transgene sequence 3* to the polyadenylation sequence; 

d) A prompter sequence 3* to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

27. The adenovirus vector of claim 16, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A promoter sequence 3* to the left inverted terminal repeat sequence; 

c) A transgene sequence 3' to the promoter sequence; 

d) A polyadenylation sequence 3' to the transgene sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

28. The adenovirus vector of claim 26 or 27, wherein the left and the right cassette 
inverted terminal repeat sequences each comprise an adenoviral-associated 
inverted terminal repeat sequence. 

29. The adenovirus vector of claim 26 or 27, wherein the transgene sequence is 
selected from a group consisting of a therapeutic gene, a selectable gene, and a 
reporter gene. 

30. The adenovirus vector of claim 24, wherein the therapeutic gene is selected from 
a group consisting of gamma globin, and human alpha- 1 anti-trypsin. 
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The adenovirus vector of claim 24, wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline, and gentamycin. 

Hie adenovirus vector of claim 24, wherein the reporter gene is selected from a 
group consisting of green fluorescent protein, beta galactosidase, alkaline 
phosphatase. 

The transgene cassette of claim 26 or 27 further comprising an inverted repeat 
sequence located 3' to the left inverted terminal repeat sequence or located 5' to 
the right inverted terminal repeat sequence. 

The transgene cassette of claim 26 or 27, further comprising an insulator element 

The transgene cassette of claim 26 or 27, further comprising a bacterial origin of 
replication. 

The adenoviral vector of claim 16, wherein the adenoviral sequences which direct 
adenoviral replication are selected from a group consisting of £2 and £4; £1, £2 
and E4; E2 and E4; and E2, E3, and E4. 

A recombinant gutless adenovirus vector a portion of which integrates into a host 
cell genome, comprising: 

' a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3* to the left adenovirus inverted 
terminal repeat sequence; 

c) A transgene cassette sequence 3* to the adenoviral packaging sequence; 
and 

d) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of the 
transgene cassette sequence into the host cell genome. 
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38. The adenovirus vector of claim 36, wherein the left and right adenovirus inverted 
repeat sequence and die packaging sequence are from the same adenoviral 
serotype, (this is the base vector) 

5 39. The adenovirus vector of claim 36, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3* to the left inverted terminal repeat 
sequence; 

10 c) A transgene sequence 3* to the polyadenylation sequence; 

d) A promoter sequence 3' to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3 * to the polyadenylation 
sequence. 

15 40. The adenovirus vector of claim 36, wherein the transgene cassette sequence 
comprises: 

b) A left cassette inverted terminal repeat sequence; 

c) A promoter sequence 3' to the left inverted terminal repeat sequence; 

d) A transgene sequence 3* to the promoter sequence; 

20 e) A polyadenylation sequence 3* to the transgene sequence; and 

f) A right cassette inverted terminal repeat sequence 3 * to the polyadenylation 
sequence. 

41. The adenovirus vector of claim 38 or 39, wherein the left and the right inverted 
25 terminal repeat sequences each comprise an adenoviral-associated inverted 

t erminal repeat sequence. 

42. The adenovirus vector of claim 38 or 39, wherein the transgene sequence is 
selected from a group consisting of a therapeutic gene, a selectable gene, and a 

30 reporter gene. 
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The adenovirus vector of claim 41, wherein the therapeutic gene is selected from 
a group consisting of gamma globin, and human alpha- 1 anti-trypsin. 

The adenovirus vector of claim 41, wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline, and gentomycin. 

The adenovirus vector of claim 41, wherein the reporter gene is selected from a 
group consisting of green fluorescent protein, beta galactosidase, alkaline 
phosphatase. 

The adenovirus vector of claim 38 or 39 further comprising an inverted repeat 
sequence located 3' to the left inverted terminal repeat sequence or located 5' to 
the right inverted terminal repeat sequence. 

The adenovirus vector of claim 38 or 39, further comprising an insulator element. 

The adenovirus vector of claim 38 or 39, further comprising a bacterial origin of 
replication. 

The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises a 5' portion of a gene of interest. 

The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises a 3' portion of a gene of interest 

A method of producing a resolved gutless adenovirus vector in a suitable cell, said 
method comprising introducing a first and a second adenovirus vectors of claim 1 
or 16 into the cell under suitable conditions so that the recombinant adenovirus 
vectors undergo homologous recombination thereby producing a resolved gutless 
adenovirus vector* 



121 



SUBSTITUTE SHEET (RULE 26) 





WO 00/73478 



PCTAJSOO/15442 



52. 
53. 

5 

54. 

10 



15 



55. 

20 

56. 



57. 

25 

58. 



A resolved gutless adenovirus vector produced by the method of claim 50. 

The method of claim 50, wherein the first adenovirus vector comprises a 
transgene cassette having a 5' portion of a gene of interest, and wherein the 
second adenovirus vector comprises a transgene cassette having a 3* portion of 
the gene of interest, and wherein a part of the 5' portion overlaps with a part of 
the 3 * portion so that homologous recombination occurs. 

A method of producing a resolved gutless recombinant Ad vector by homologous 
recombination in a suitable cell, said method comprising contacting two parental 
recombinant Ad vectors, each comprising a transgene cassette containing a 
portion of a selected transgene with a region of overlapping homology, so that the 
first and second parental recombinant Ad vectors undergo homologous 
recombination at the region of overlapping homology, resulting in a resolved 
recombinant gutless Ad vector having both portions of the selected transgene, and 
wherein the selected transgene is within a transgene cassette flanked by a pair of 
ITRs. 

A resolved gutless adenovirus vector produced by the method of claim 53. 

An adenovirus library comprising a plurality of adenovirus vectors expressing 
fiber proteins which are displayed and modified by random peptide insertions. 

The library of claim 55, wherein said fiber protein so displayed comprises a 
random peptide substituted in the G-H loop of the fiber protein knob domain. 

A screening method for targeting adenovirus vectors for gene therapy comprising 
contacting the adenovirus library of claim 55 with a plurality of cells so that the 
cells are transduced with the adenovirus vectors of the adenovirus library 
transduction occurs and detecting the cells so transduced. 
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59. The adenovirus vector of claim 5, 6, 26, 27, 38, or 39 further comprising a 
nucleotide sequence encoding a rep78 protein. 
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RECOMBINANT ADENOVIRAL VECTORS EXPRESSING CHIMERIC FIBER PROTEINS FOR CELL SPECIFIC 
INFECTION AND GENOME INTEGRATION 

5 FIBER PROTEINS 

This invention was made, at least in part, with funding from the National Institutes of 
Health (Grant Nos. R01 CA 80192-01 and R21 DK 55590-01). Accordingly, the United 
1 0 States Government has certain rights in this invention. 

FIELD OF THE INVENTION 

This invention relates to the field of gene therapy, and in particular, to novel adenovirus 
15 (Ad) vectors that selectively infect cells for gene therapy., and to Ad vectors containing 
modifications of the fiber protein to allow retargeting of any adenovirus serotype. 

BACKGROUND OF THE INVENTION 

20 Gene transfer vectors require the efficient transduction of target cells, stable association 
with the host genome, and adequate transgene expression in the appropriate target cell, 
without associated toxic or immunological side effects,. Currently available viral vector 
systems, including recombinant retroviruses, adenoviruses and adeno-associated viruses, 
are not suitable for efficient gene transfer into many cell types. Retroviral vectors require 

25 cell division for stable integration. Recombinant adenoviruses are not able to infect many 
cell types important for gene therapy, including hematopoietic stem cells, monocytes, T- 
and B-lymphocytes. Moreover, recombinant adeno-associated vectors (AAV) integrate 
with a low frequency. 

30 First generation adenoviruses have a number of properties that make them an attractive 
vehicle for gene transfer (Hitt, M.M. et al. 1997 Advances in Pharmacology 40:137-205). 
These include the ability to produce purified virus at high titers in concert with highly 
efficient gene transfer of up to 8 kb long expression cassettes into a large variety of cell 
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types in vivo, including non-dividing cells. Limitations of first generation adenoviruses 
include the development of immune responses to expressed viral proteins resulting in 
toxicity and virus clearance. The episomal status of adenoviral DNA within transduced 
cells is another limitation of first generation Ad vectors. Stable integration of adenovirus 
5 DNA into the host genome is reported only for wild-type forms of specific subtypes and 
appears not to occur in a detectable manner with El/E3-deleted Ad 5 (adenovirus 
serotype 5) vectors widely used for gene transfer in vitro and in vivo [Hitt, M.M. et al. 
1 997 Advances in Pharmacology 40: 137-205]. 

10 Recombinant AAV vectors (rAAV) integrate with a low frequency (about 1 out of 20,000 
genomes) randomly as cocatemers into the host genome (Rutledge, E.A.; Russel, D.W. 
1997 J. Virology, 71, 8429-8436). The presence of two AAV inverted terminal repeats 
(ITRs) and as yet unknown host cellular factors seem to be the only requirement for 
vector integration (Xiao, X., et al, 1997, J. Virology, 71, 941-948; Balague, C, et al. 

15 1997, J. Virology, 71, 3299-3306; Yang, C.C. 1997, J. Virology, 71, 9231-9247). In the 
presence of the large AAV Rep proteins, AAV integrates preferentially into a specific site 
at human chromosome 19, called AAVS1 (Berns, K.I., 1996, Fields Virology, Fields, 
B.N. et al. (ed) Vol. 2, Lippincott-Raven, Philadelphia, PA, 2173-2220). The AAV 
capsid is formed by three coat proteins (VP 1-3), which interact with specific heparin 

20 sulfates on the cell surface and probably with specific receptor(s). However, many cell 
types, including hematopoietic stem cells, lack these structures so that rAAV vectors 
based on AAV2 cannot infect or transduce these cells (Malik P. et al., 1997, J. Virology, 
71, 1776-1783; Quing, K.Y., et al. 1998, J. Virology, 72, 1593-1599). Other 
disadvantages of rAAV vectors include the limited insert size (4.5-5kb) that can be 

25 accommodated in rAAV vectors lacking all viral genes and low transducing titers of 
rAAV preparations. 

Adenovirus infection is initiated by attaching to the cell surface of Ad 5 via its fiber 
protein (for a review, see Shenk, T. 1996 Fields Virology, Vol 2, Fields, B.N. et al. (ed) 
30 Vol. 2, Lippincott-Raven, Philadelphia, PA, 2111-2148). The distal, C-terminal domain 
of the trimeric fiber molecule terminates in a knob, which binds to a specific cellular 
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receptor identified recently as the coxackie-adenovirus receptor (CAR) (Bergelson, J.M. 
et al. Science, 275, 1320-1323). After binding, in an event independent of virus 
attachment, Arg-Gly-Asp (RGD) motifs in the penton base interact with cellular integrins 
of the <x3 and P5 types. This interaction triggers cellular internalization whereby the 
5 virion achieves localization within the endosome. The endosomal membrane is lysed in a 
process mediated by the penton base, releasing the contents of the endosome to the 
cytoplasm. During these processes, the virion is gradually uncoated and the adenoviral 
DNA is transported to the nucleus where replication takes place. The terminal protein, 
which is covalently attached to the viral genome and the core protein V that is localized 

10 on the surface of the cores have nuclear localization signals (NLSs) (van der Vliet, B. 
1995, The Molecular Repertoir of Adenoviruses, Vol. 2, Doerfler, W. and Boehm, P.(ed.), 
Springer Verlag, Berlin, 1-31). These NLSs play a crucial role in directing the adenoviral 
genome to the nucleus and probably represent the structural elements which allow 
adenovirus to transduce non-dividing cells. When the double-stranded, linear DNA 

15 reaches the nucleus, it binds to the nuclear matrix through its terminal protein. 

Since the cell types that can be infected with Ad5 or Ad2 vectors are restricted by the 
presence of CAR and specific integrins, attempts have been made to widen the tropism of 
Ad vectors. Genetic modification of adenovirus coat proteins to target novel cell surface 

20 receptors have been reported for the fiber (Krasnykh, V. et al. 1998 J. Virology, 72, 1844- 
1852, Krasnykh, V. et al. 1996 J. Virology, 70, 6839-6846, Stevenson, S.D., et al. 1997, 
J. Virology, 71, 4782-4790), penton base (Wickham, T.J., et al. 1996, J. Virology, 70, 
6831-6838; Wickham, T.J., et al. 1995, Gene Therapy, 69, 750-756), and hexon proteins 
(Crompton, J., et al. 1994,7 Gen. Virol. 75, 133-139). The most promising modification 

25 seems to be the functional modification of the fiber protein or more specifically of the 
fiber knob as the moiety, which mediates the primary attachment. Two groups have 
reported the generation of fibers consisting of the Ad5 tail/shaft and the knob domain of 
Ad3 (Krasnykh, V. et al. 1996 supra, Stevenson, S.D., et al. 1997, supra). Recently, 
recombinant adenoviruses with fibers containing C -terminal poly-lysine, gastrin-releasing 

30 peptide, somatostatin, E-selectin-binding peptide, or oligo-Histidines were produced in 
order to change the native tropism of Ad5. Krasnikh et al. found (Krasnykh, V. et al. 
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1998 supra) that heterologous peptide ligands could be inserted into the HI loop of the 
fiber knob domain without affecting the biological function of the fiber. Based on studies 
with other Ad serotypes, it appears that the length of the fiber shaft is a critical element, 
determining the efficiency of interaction with cell surface integrins and the internalization 
5 process. Thus far, there is no reported data demonstrating successful retargeting of Ad5 
vectors for a specific cell type. 

Therefore, there is a present need for an improved adenovirus vector which can be 
targeted efficiently to a variety of cell types and tissues and remain stably integrated in 

10 the host genome with minimal antigenicity to the host. The present invention discloses 
novel chimeric adenoviral (Ad) Ad- AAV vectors, which express a modified fiber protein 
on their capsid, for specifically targeting the vector. Methods of making, uses and 
advantages of these vectors are described. In addition, the alteration described for the 
knob and shaft domains of the fiber protein provide a novel approach to retarget any 

15 adenovirus serotype for cell specific infection. 

SUMMARY OF THE INVENTION 

The present invention provides for novel chimeric Ad- vectors carrying transgene, or 
20 portions of transgenes for stable and efficient gene transfer into diverse cell types or 
tissues in a CAR- and/or a o P3/ 5 - independent manner. Also provided are methods for 
producing such vectors and the use thereof for gene therapy to target a specific cell type 
or tissue. 

25 The recombinant adenovirus vectors of the invention (Example I) provide a novel design 
that allows for the easy production and delivery of a "gutless" adenoviral vector with the 
added advantage of stable integration of the transgene into the host genome of different 
cell type. The adenoviral vector described is devoid of all adenoviral sequences except 
for the 5 1 and 3' cis elements necessary for replication and virion encapsidation. The 

30 adenovirus-associated virus sequences of the invention comprising the 5' (right) and 3' 
(left) inverted terminal repeats (ITRs) flank the transgene gene cassette such that they 



4 




WO 00/73478 PCT/US00/15442 

direct homologous recombination during viral replication and viral integration into the 
host genome. In one embodiment AAV-ITR flanking sequences are used. The vector 
also contains a selected transgene(s) operably linked to a selected regulatory element and 
a polydenylation stop signal, which is in turn flanked by the flanking sequences described 
5 above. The selected transgene(s)can be linked under the same regulatory elements or 
under separate regulatory elements in the same orientation or in opposite orientations 
with respect to each other. The selected transgene(s) are any gene or genes which are 
expressed in a host cell or tissue for therapeutic, reporter or selection purposes. This 
vector is characterized by high titer transgene delivery to a host cell and the ability to 
10 stably integrate the transgene into the host genome. Also provided is a method to 
improve the integration frequency and site specific integration by incorporating an AAV 
rep protein into the recombinant hybrid vector. 

The invention also provides chimeric fiber proteins (Example II), which includes 
15 naturally occurring fiber proteins in which a portion or portions of the sequence are 
modified to alter cell or tissue specificity of infection. Altered fiber protein sequences 
can include fiber protein domains (the knob domain, the shaft domain, and the tail 
domain) from other or the same adenovirus serotypes or from randomly selected 
peptides. A chimeric fiber protein can be entirely composed of non-naturally occurring 
20 sequences. The invention further relates to nucleic acid sequences encoding the chimeric 
fiber proteins. These nucleic acid sequences can be naturally occurring, a mixture of 
naturally occurring and non-naturally occurring sequences, or entirely non-naturally 
occurring sequences. 

.25 The heterologous fiber protein sequences described herein can be inserted into any 
adenovirus based vector which contains a capsid, rendering the virus capable of 
specifically infecting a given cell or tissue. Adenoviral vectors having such a 
heterologous fiber sequence can be used to direct gene transfer into desired cells. For 
stable integration of the transgene cassette into the host gemone, the chimeric Ad-AAV 

30 vector described in the invention is the preferred vector of use. 
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The invention also includes a library of adenoviruses displaying random peptides in their 
fiber knobs can be used as ligands to screen for an adenovirus variant with tropism to a 
particular cell type in vitro and in vivo. 

5 The chimeric Ad- vectors described herein include the Ad.AAV genome with a modified 
fiber protein expressed on its capsid. These chimeric vectors are designed to infect a 
wide variety of cells, in particular, the cells which can only be poorly transduced by the 
commonly used retroviral, AAV and adenoviral vectors. These cells include, but not 
limited to, hematopoietic stem cells, lung epithelial cells, dendritic cells, lymphoblastoid 
10 cells, and endothelial cells. Hematopoietic stem cells such as CD34+ cells can be targeted 
for gene therapy of sickle cell anemia and thalasemia using the vector described herein. 
The chimeric Ad-AAV vector capable of transducing genes into endothelial cells can be 
used in gene therapy for vascular diseases such as atherosclerosis or restinosis after 
coronary artery surgery. 

15 

BRIEF DESCRIPTION OF THE FIGURES 

Figs. 1 A- 1C display a proposed mechanism for forming of AAd. AAV 1 genome. 

20 Figs. 2A and 2B show electron photomicrographs of hybrid virus particles: Fig. 2A 
shows Ad.AAV 1 and Fig. 2B shows AAd. AAV 1. 

Fig. 3 illustrates analysis of AAd. AAV 1 genomes after transduction of SKHepl cells. 
Pulse field gel eletrophoresis (PFGE). lxlO 6 control Sk Hepl cells (SKHepl) (lanes 1-3, 

25 5, 9). SKHepl cells from G418 resistant pools (AAd.AAVl) (infected with AAd.AAVl 
and selected for 4 weeks) (lanes 6-8, 10-12), or SKHepl cells collected at 3 days after 
infection with 2000 genomes AAAAV1 (Ad) lanes 4, 13) are sealed in agarose plaques, 
lysed in situ and subjected to PFGE with or without prior digestion with restriction 
endonucleases. Southern Blot is performed with a SEAP specific probe. U = undigested, 

30 P = digested with PI- Seel, 1 = I-Ceul, E = EcoREI. 
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Figs. 4A and 4B show response of K562 and CD34+ cells respectively after infection 
with AAd.AAVBG. Cells are incubated for 6 hours with virus under agitation. At day 3 
after infection, transduction frequency is calculated based on the number of X-Gal 
positive cells. Viability is tested by trypan blue exclusion. N = 3, SEM < 10%. 

5 

Fig. 5 shows Rep expression in SKHepl and 293 cells after plasmid transfection. 5xl0 5 
cells are transfected with pAAV/Ad, pRSVrep, or pPGKrep by Ca-phosphate co- 
precipitation. Three days after transfection, cells are harvested. Lysates are separated on 
a 10% PA gel, followed by Western Blot with Rep specific antibodies (03-65169), 
10 American Research Products), and developed with ECL (Amersham). 

Fig. 6 shows detection of vector integration into AAVS1 by PFGE. 

Fig. 7 shows strategy for creating an AAd.AAV hybrid vector capable of site-specific 
15 integration. Arrows indicate promoters, (PA) = polyadenylation signal. ¥ = adenoviral 
packaging signal. 

Fig. 8A-8B shows vectors for transduction studies with SNori as expression unit and 
analysis of vector integration on genomic DNA from a small cell number. Analgous 
20 vector sets can be generated with P-galactosidase (BG) or green fluorescence protein 
(GFP) as reporter genes. 

Fig. 9 shows strategy for substituting the Ad5 fiber sequence by the heterologous fiber X 
genes using recombination in E. coll. 

25 

Fig. 10 shows the expression of CAR and a v -integrins on test cells. For flow cytometry* 
analysis, HeLa, CHO, K562, and CD34+ cells were incubated with monoclonal anti-CAR 
(RmcB, 1 :400 dilution) or anti-a v -integrin antibodies (L230, 1 :30 dilution). As a negative 
control, cells were incubated with an irrelevant mouse monoclonal antibody (anti-BrdU, 
30 1:100 dilution). The binding of primary antibodies was developed with anti-mouse IgG- 
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FTTC labeled conjugates (1:100 dilution). Data shown represent the average results of 
quadruplicate analyses performed on 10 4 cells. 

Fig. 1 1 shows the electron microscopy of adenovirus particles. Purified particles from 
5 Ad5, 9, and 35 were negative contrast stained and analyzed at a magnification of 
85,000x. Defective particles are highlighted by arrows. 

Fig. 12 shows the analysis of attachment and internalization of different serotypes to 
CHO, HeLa, K562, and CD34+ cells. Equal amounts of [ 3 H]-thymidine-labeled virions 

10 of Ads 3, 4, 5, 9, 35, and 41 (measured by OD 2 60, and equivalent to an MOI of 400 pfu 
per cell for Ad5) were incubated for one hour on ice as described in Materials and 
Methods. Cells were then washed, and the number of labeled virions bound per cell was 
determined. For internalization studies, viruses were first allowed to attach to cells for 1 h 
on ice. Then, unbound viral particles were washed out. Cells were then incubated at 37°C 

15 for 30 min followed by treatment with trypsin-EDTA and washing to remove 
uninternalized viral particles. The data were obtained from two to four independent 
experiments performed in triplicate. Note the different scale on the Y-axes for CD34+ 
cells. 

20 Figs. 13A-13C show attachment and internalization of different adenovirus serotypes to 
Hela, CHO and 293 cells respectively. 

Figs. 14A and 14B show attachment and internalization of different adenovirus serotypes 
to CD34+ and K-562 cells respectively. 

25 

Fig. 15A-15C shows the analysis of viral replication in K562 and CD34+ cells by 
Southern blot analysis of methylated viral DNA. Replication studies were performed 
with 1x10 s K562 cells (A) or CD34+ cells (B), infected with methylated Ad5, Ad9 or 
Ad35. The lane labeled as "load" represents DNA that was extracted form the media/cell 
30 mixture immediately after adding the indicated viral dose to cells. The intensities of 
bands corresponding to methylated and un-methylated viral DNA indicate that -85% of 
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the input virus was methylated. To quantify adsorption and internalization, DNA analysis 
was performed after prior incubation of virus with cells at 0°C (adsorption) or 37°C 
(internalization). For dose dependent replication studies, the indicated viral dose 
(expressed as the number of genomes) was added to the cells, and cellular genomic DNA 
5 together with viral DNA was extracted 16 hours or 36 hours post-infection for K562 and 
CD34+cells, respectively. Identical amounts of sample DNA were analyzed by Southern 
blot. For quantification purposes, Ad9 replication was analyzed together with Ad5 using 
an Ad5/9 chimeric probe that hybridizes with DNA of both serotypes (C). The analysis of 
Ad5 versus Ad35 replication was performed with the corresponding Ad5/35 chimeric 
10 probe. Since separate hybridizations with both Ad5/35 and Ad5/9 probes gave identical 
. signal intensities for Ad5 DNA only one panel is shown for Ad5 replication in test cells. 
To produce distinguishable fragments specific for the methylated or non-methylated 
status of viral genomes, Ad5 DNA was digested with Xho I, while Ad9 and Ad35 DNA 
. was digested with Xho I and Hind III. The bands specific for methylated (not-replicated) 
15 viral DNA were ~12kb for Ad9, 35kb for Ad5, and ~12kb for Ad35. The fragments 
specific for non-methylated DNA were 5.8kb for Ad9, 6.1kb for Ad5, and 9.5kb for 
Ad35. Chimeric Ad5/9 and Ad5/35 DNA fragments (1.8kb) were used as quantification 
standards and applied onto gel together with digested viral/cellular DNA (shown on the 
left part of the figures). 

20. 

Fig. 16A-16B shows the structure of AdSGFP and chimeric Ad5GFP/F35 vectors. A) 
Schematic diagram of the original E1/E3 deleted Ad5-based vector with GFP-expression 
cassette inserted into the E3 region (Ad5GFP) and the chimeric vector Ad5GFP/F35 
containing the Ad5/35 fiber gene. The 2.2kb Ad5 fiber gene was replaced by a 0.9kb 

25 chimeric fiber gene encoding for the short shaft and knob of Ad35 by a technique that 
involved PCR-cloning and recombination in E.coli. Kpn I (K) and Hind III (H) sites 
localized within or around the fiber genes are indicated. The lower panel shows the 
detailed structure of the chimeric fiber region. The Ad5 fiber tail [amino acids (aa): 1-44] 
were joined in frame to the Ad35 fiber shaft starting from its first two amino acids (GV), 

30 which are conserved among many serotypes. A conserved stretch of amino acids TLWT 
marks the boundary between the last p-sheet of Ad35 shaft and the globular knob. The 
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Ad35 fiber chain termination codon is followed by the Ad5 fiber poly-adenylation signal. 
The region of Ad5GFP/F35 encoding for chimeric fiber was completely sequenced with 
Ad5 specific primers (see Material and Methods). B) Restriction analysis of viral 
genomes. Viral DNA was isolated from purified AdSGFP and Ad5GFP/F35 particles as 
5 described elsewhere. One microgram of DNA was digested with Hind III or Kpn I and 
separated in ethidium bromide stained agarose gels (left panel) which were subsequently 
blotted and analyzed by Southern blot with an Ad5 E4 specific probe (nt 32,7775-33,651) 
(right panel). Specific patterns, designating the correct structure for both viral vectors 
were detected. The Hind III fragments specific for Ad5GFP and Ad5GFP/F35 were 2.9kb 
10 and 4.9kb, respectively. The Kpn I fragment that confirmed the correct Ad5GFP/F35 
structure was 1.6kb compared to a 7.6kb Ad5GFPfragment. M - lkb ladder (Gibco-BRL, 
Grand Island, NY). 

Fig. 17 shows the generation of AAd.AAV genenomes by recombination between 
15 inverted homology regions. Recombination between two inverted repeats (IR) present in 
one Ad.AAV vector. The first-generation Ad.AAV vector (~34kb) contains two 1 .2kb 
inverted homology elements flanking the transgene cassette. One AAV-ITR is inserted 
between the Ad packaging signal (\j/) and the left IR. During Ad replication, 
recombination between the Irs mediates the formation of the AAd.AAV genomes with the 
20 transgene flanked by Irs, AAVITRs, Ad packaging signals, and Ad ITRs. These 
genomes are efficiently packaged into Ad capsids. 

Fig. 18 shows the structure of Ad5/1 1, Ad5/35. Schematic diagram of the original E1/E3 
deleted Ad50based vector with GFP-expression cassette inserted into the E3 region 

25 (Ad5GFP) and the chimeric vector Ad5GFP/F35 containing the Ad5/35 fiber gene. The 
2.2kb Ad5 fiber gene was replaced by a 0.9kb chimeric fiber gene encoding for the short 
shaft and knob of Ad35 by a technique that involved PCR-cloning and recombination in 
E.col. Kpn I (K) and Hind III (H) sites localized within or around the fiber genes are 
indicated. The lower panel shows the detailed structure of the chimeric fiber region. The 

30 Ad5 fiber tail [amino acids (aa): 1-44] were joined in frame to the Ad35 fiber shaft 
starting from its first two amino acids (GV), which are conserved among many serotypes. 
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A conserved stretch of amino acids TLWT marks the boundary between the last P-sheet 
of Ad35 shaft and the globular knob. The Ad35 fiber chain termination codon is 
followed by the Ad5 fiber poly-adenylation signal. 

Fig. 19 shows the cross-competition for attachment and internalization of labeled 
AdSGFP, Ad35 s and chimeric Ad5GFP/F35 virions with unlabeled viruses, and with anti- 
CAR or anti-ctv-integrins Mab. (A) For attachment studies, 10 5 K562 cells were pre- 
incubated with a 100-fold excess of unlabeled competitor virus at 4°C for 1 h. Then, 
equal amounts of [ 3 H]-labeled viruses, at a dose equivalent to an MOI of 100 pfii per cell 
determined for AdSGFP, were added to cells followed by incubation at 4°C for 1 h. Cells 
were then washed with ice-cold PBS, pelleted and the percentage of attached virus (cell- 
associated counts per minute) was determined. For analysis of cross-competition for 
internalization, cells were pre-incubated with a 100-fold excess of competitor virus at 
37°C for 30 min before labeled virus was added. After an additional incubation at 37°C 
for 30 min, cells were treated with trypsin-EDTA for 5 min at 37°C, washed with ice-cold 
PBS, pelleted, and the percentage of internalized virus was determined. For controls, cells 
were incubated with labeled viruses without any competitors. Preliminary experiments 
had shown that the conditions chosen for competition studies allowed for saturation in 
attachment/internalization on K562 cells for all unlabeled competitors. (B) 10 5 K562 cells 
were pre-incubated for 1 hour at 4°C with ahti-CAR MAb (RmcB, diluted 1:100) or with 
anti-otv-integrin MAb (L230, diluted 1:30), followed by incubation with labeled viruses 
according to the protocols for attachment or for internalization as described above. For 
each particular serotype, the percentage of attached/internalized virus was compared to 
the control settings, where cells were preincubated under the same conditions with a 
1:100 dilution of an irrelevant antibody (anti-BrdU Mab) before addition of the labeled 
virus. Note that the specific competitors but not the corresponding controls significantly 
inhibited Ad5 internalization to a degree that is in agreement with published data (59). 
N>/=4 

Fig. 20. Cross-competition for attachment and internalization of [ 3 H]-labeled AdSGFP, 
Ad35, and chimeric Ad5GFP/F35 virions with unlabeled Ad3 virus (A), and of [ 3 H]- 
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labeled Ad3 virions with unlabeled viruses (B). 1 0 5 K562 cells were pre-incubated with 
a 100-fold excess of unlabeled viral particles according to attachment or internalization 
protocols described for Fig.6. Equal amounts of [ 3 H>labeled AdSGFP, Ad5GFP/F35, or 
Ad35 (A) or [ 3 H]-labeled Ad3 (B) were added to cells at a dose equivalent to an MOI of 
5 100 pfu per cell for AdSGFP. In control settings, cells were incubated with labeled 
viruses without any competitors. N=4. 

Fig. 21 shows the transduction of CD34+, K562, and HeLa cells with Ad5GFP and 
chimeric Ad5GFP/F35 vectors, lxl 0 5 cells were infected with different MOIs (pfu/cell) 
10 of viruses in 100 \il of media for 6 hours at 37°C. Virus containing media was then 
removed, and the cells were resuspended in fresh media followed by incubation for 18 h 
at 37°C. The percentage of GFP expressing cells was determined by flow cytometry. N=3 

Fig. 22 shows the distribution of GFP-positive cells in subpopulations of human CD34+ 
15 cells expressing CAR or <x v -integrins. lxl 0 5 CD34+ cells were infected with AdSGFP or 
Ad5GFP/F35 at an MOI of 200 pfu/cell as described for Fig.8. Twenty-four hours after 
infection, cells were incubated with anti-CAR (1:100 final dilution) or anti-ct v -integrin 
(1:30 final dilution) primary MAbs for 1 h at 37°C. Binding of primary antibodies was 
developed with anti-mouse IgG-PE labeled secondary MAbs (1:100 final dilution) at 4°C 
20 for 30 min. For each variant, 10 4 cells were analyzed by flow-cytometry. The mock 
infection variants represent cells incubated with virus dilution buffer only. The quadrant 
borders were set based on the background signals obtained with both the GFP- and PE- 
matched negative controls. The percentages of stained cells found in each quadrant are 
indicated. The data shown were representative for three independent experiments. 

25 

Fig. 23 A-23B shows the distribution of GFP-positive cells in a subpopulation of human 
CD34+ cells, expressing CD34 and CD117 (c-kit). (A) Co-localization of GFP 
expression with CD34 or CD117: CD34+ cells were infected with AdSGFP or 
Ad5GFP/F35 at an MOI of 200 pfii per cell under the conditions described for Fig.8. 
30 Twenty-four hours after infection, cells were incubated with anti-CD34 PE-conjugated 
MAbs (final dilution 1:2) or with anti-CD117 PE-conjugated MAbs (final dilution 1:5) 
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for 30 min on ice, and 10 4 cells per variant were subjected to two-color flow cytometry 
analysis. For negative control staining, no antibodies were added to the cells before 
analysis. The mock infection variants represent cells incubated with virus dilution buffer 
only. The quadrant borders were set based on the background signals obtained with both 
5 the GFP- and PE-matched negative controls. The percentages of stained cells found in 
each quadrant are indicated. The experiment was performed two times in triplicates, and 
typically obtained results are shown. The SEM was less than 10% of the statistical 
average. (B) Transduction of CD34+/CD117+ cells with AdSGFP and chimeric 
Ad5GFP/F35 virus vectors: CD34+ cells, cultured overnight before staining in media 

10 without SCF, were incubated with PE-labeled anti-CD 1 17 MAb for 30 min on ice. The 
fraction of CD117-positive cells was sorted by FACS. More than 97% of sorted cells 
were positive for CD 117. 1x10 s CD117+/CD34+ cells were infected with Ad5GFP or 
Ad5GFP/F35 at an MOI of 200 pfu per cell, as for Fig.8. Twenty-four hours post 
infection, the percentage of GFP positive was determined by flow cytometry. For mock 

15 infection, CD1 17+/CD34+ cells were incubated with virus dilution buffer only. The 
infections were done in triplicates, and the average percentage of GFP-expressing cells is 
indicated on the corresponding histogram. The SEM was less than 10% of the statistical 
average. 

20 Fig. 24 shows the southern analysis of viral genomes in GFP-positive and GFP-negative 
fractions of CD34+ cells infected with the AdSGFP and chimeric Ad5GFP/F35 vectors. 
CD34+ cells were infected with viruses at an MOI of 100 as described for Fig.21 . Twenty 
four-hours post infection, cells were sorted by FACS for GFP positive and GFP negative 
fractions. 10 5 cells from each fraction were used to isolate genomic DNA together with 

25 viral DNA. Before cell lysis, a rigorous treatment with trypsin and DNase followed by 
washing was performed to exclude that genomic DNA samples were contaminated by 
extracellular viral DNA. A) The upper panel shows the ethidium bromide stained 1% 
agarose gel before blotting demonstrating that similar amounts of genomic DNA were 
loaded. This amount corresponded to DNA isolated from -25,000 GFP+ or GFP- cells. 

30 The lane labeled Aload@ represents viral DNA purified from AdSGFP or Ad5GFP/F35 
virions mixed with pBluescript plasmid DNA (Stratagene) as a carrier and applied on a 
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gel at the amount that was actually used to infect 25,000 cells. As a concentration 
standard, a serial dilution of AdSGFP genomes was loaded on the gel (left side). For 
Southern analysis (lower panel), an 8kb-Iong Hindffl fragment corresponding to the E2 
region of Ad5 was used as a labeled probe. Hybridized filters were subjected to 
5 Phospholmager analyis and then exposed to Kodak-X-OMAT film for 48 h at B70°C. 
The cellular/viral genomic DNA is. indicated by an arrow. (B) To detect AdSGFP 
genomes in transduced cells, PCR amplification followed by Southern blot hybridization 
was performed on the same samples that were used for quantitative Southern blot 
hybridization in (A). DNA purified from -2,500 cells was subjected to PCR (95°CBlmin, 

10 53°C-lmin, 72°CB Imin, 20 cycles with primers Ad5-Fl and Ad5-Rl). One fifth of the 
PCR reaction was subjected to agarose gel electrophoresis (upper panel). A 0.9 kb-Iong 
DNA fragment, specific to the E4 region of Ad5 was detected for transduced 
Ad5GFP/F35 genomes. DNA then was blotted onto Nybond-N+ membrane and Southern 
blot hybridization (lower panel) with an Ad5 E4 specific DNA probe was performed. In 

15 addition to the 0.9kb DNA fragment, the PCR primers generated a smaller 0.5 kb-long 
fragment that also hybridized with with the E4 region probe. 

Fig. 25 shows the role of fiber shaft length in Ad infection strategies. CAR binding (Ad5 
and Ad9) variants and Ad35, which interacts with a non-CAR receptor were analyzed on 
20 CAR expressing cells (293, Y79) and K562 cells which do not express significant CAR 
amounts. All vectors contain a GFP expression cassette packaged into an Ad5 capsid 
with modified fibers. 

Fig. 26 shows the tertiary structure of Ad5 knob: localization of CAR binding sites, H-I 
25 loop and G-H loop. 

Fig. 27 shows the substitution of the G-H loop with heterologous peptides. 

Fig. 28 shows the attachment and internalization of metabolically labeled serotypes with 
30 human cell lines. 
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Figs. 29A-29D shows the generation of Rep78 expressing Ad vectors by recombination 
between two vectors. (A) The same strategy outlined in Fig. 15 was employed for 
vectors with rep 78 as a transgene. The Ad5*rep vector also contained the ApoEhAAT 
promoter shielded by an HS-4 insulator. The region of homology between the two 
fragments of the rep78 gene was 658nt in length. The Rep78 ORF was deleted for the p5 
promoter. The internal Rep 40/52 start codon (at position 993) was mutated to abolish 
production of the small Rep proteins. Furthermore, the splice site at nt 1905 was deleted 
eliminating production of Rep68. The individual expression of Rep 78 was 
demonstrated. (B) Formation of AAd.rep78 genomes. The expected 5.8kb AAdrep78 
genome was only observed upon coninfection of both AdSYep and Ad3'rep into 293 celss 
as demonstrated by Southern. (C) Southern blot analysis for rescue of the recombinant 
AAV genome from plasmid DNA by Rep78 expressed from pCMVrep78 and AAd.rep78. 
The expected rescue product is 3.8kb (R-plasmid). (D) Southern blot analysis for rescue 
of the recombinant AAV genome from Ad. AAV viral vector genomes. 

DETAILED DESCRIPTION OF THE INVENTION 

All scientific and technical terms used in this application have meanings commonly used in 
the art unless otherwise specified. As used in in the present invention, the following words 
or phrases have the meanings specified. 

The term vector includes, but is not limited to, plasmids, cosmids, phagemids, and 
artificial chromosomes. The vector sequence may be designated as the viral "base 
vector" sequence. The base vector sequence is dependent upon the particular type of 
virus and serotype that the base vector sequence was derived from. The base vector 
sequence may be linked to non-vector or transgene sequences (e.g., heterologous 
sequences). 

The transgene sequences may include sequences that confer compatibility with 
prokaryote or eukaryote host cells, where compatibility relates to vector replication 
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within a host cell. Accordingly, the transgene sequence may be a replicon sequence that 
directs replication of the vector within the host cell, resulting in an autonomously replicating 
vector. Alternatively, the transgene sequence may permit vector replication that is 
dependent upon the host cell's replication machinery. 

The base vector sequences may be a operatively linked to a transgene sequence that 
encodes a gene product, such as a polypeptide, rRNA, or tRNA. For example, the 
transgene may encode a polypeptide such as a viral capsid protein, or a viral fiber protein. 
The transgene may be derived from the same or different serotype as the base vector 
sequence. 

Another example of a transgene includes a reporter gene that encodes a gene product that 
can be used as a selectable marker, such as drug resistance or a colorimetric marker. The 
reporter gene may encode a gene product which can be readily detected by, for example, 
a visual microscopic, immunochemical, or enzymatic assay. The preferred reporter gene 
encodes a gene product that can be detected by a non-destructive method that does not 
destroy the cell that expresses the reporter gene. 

A therapeutic gene is another example of a transgene. A therapeutic gene encodes a gene 
product (e.g., polypeptide or RNA) which when expressed in a host cell provides a 
therapeutic benefit or desired function to the host cell or the tissue or the organ or the 
organism containing the host cell. The therapeutic benefit may result from modifying a 
functin of a gene in the host genome or from the additional function provided by the 
therapeutic protein, polypeptide or RNA. 

The base vector sequence may be linked to a transgene sequence that is an regulatory 
element, such as a promoters, enhancers, transcription termination signals, 
polyadenylation sequences. The regulatory element may direct expression of the 
transgene sequence that encodes a gene product by direct transcription or translation. 
The regulatory element may regulate the amount or timing of expression of the transgene 
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sequence. The regulatory element may direct expression of the transgene in certain host 
cells or tissues (e.g., host-specific or tissue-specific expression). 



The base vector sequence may linked to a transgene sequence that permits the vector, to 
integrate into another nucleotide sequence. The integration sequence may direct 
integration of the whole vector or portions of the vector. The integration sequence may 
or may not be related to the base vector sequence. For example the integration and base 
vector sequences may be from the same or different viral serotype. The integration 
sequence may be inverted repeat sequences (ITRs) from adenovirus (Ad), adenovirus- 
associated virus (AAV), or HTV. 

The base vector sequence may be linked to a transgene sequence that directs homologous 
recombination of the vector into the genome of a host cell. Such transgene sequences 
may or may not be from the same viral serotype as the base vector sequence. 

The vector may be used to transport the heterologous sequence into a host cell or into a 
host cell's genome. 

The vector may comprise multiple endonuclease restriction sites that enable convenient 
insertion of exogenous DNA sequences. 

The term "hybrid vector" as used in the invention refers to a vector which comprises a 
nucleic acid sequence combined from two different viruses (e.g. Adenovirus and AAV). 

"Chimeric vector" refers to a vector which contains nucleic acid sequences that are 
unnatural to the base vector (i.e. sequences riot occurring naturally or sequences not in 
their natural background including heterologous sequences). A chimeric vector as used 
in the invention may also be a hybrid vector. An example of a chimeric vector is 
Ad.AAV expressing a modified fiber protein an its capsid. 
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The term "transduction" or "infection" refers to a method of introducing viral DNA 
within a virus particle into a host cell. The viral DNA herein is in the form of 
recombinant virus, which is generated by linking a segment of DNA of interest into the 
viral genome in such a way that the gene can be expressed as a functional protein. 

5 

The term "transfection" refers to a method of introducing a DNA fragment into a host 
cell. 

The term "heterologous" as used herein means that a nucleic acid or peptide sequence is 
10 placed in a context that is not endogenous to the base adenovirus vector or to a 
transduced cell. For example, a peptide sequence can be transferred from a protein to 
another protein, the resulting protein is referred to herein as heterologous protein. A 
chimeric fiber protein, (e.g., a serotype 5 tail domain and a serotype 35 shaft and knob 
domain) is considered a "heterologous" to the Ad 5 vector. The term also includes 
15 nucleic acids (e.g. coding sequences) from one strain or serotype of adenovirus 
introduced into a different strain or serotype of adenovirus. 

The term "regulatory elements" is intended to include promoters, enhancers, transcription 
termination signals, polyadenylation sequences, and other expression control sequences. 
20 Regulatory elements referred to in the invention include but are not limited to, those 
which direct expression of nucleic acid sequence only in certain host cells (e.g. tissue 
specific regulatory sequences). 

The term "operably linked" indicates that a polynucleotide sequence (e.g. a coding 
25 sequence or gene) is linked to a regulatory element in such a way that the regulatory 
element sequence controls and regulates the transcription or translation or both of that 
polynucleotide sequence. The orientation of the regulatory element may vary (eg, be in 
reverse orientation with respect to the right ITR). The term also includes having an 
appropriate start signal (e.g., ATG) in front of the polynucleotide sequence to be 
30 expressed and maintaining the correct reading frame to permit expression of the 
polynucleotide sequence under the control of the expression control sequence and 
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production of the desired polypeptide or protein. Regulatory sequences can also include 
3' sequences which ensure correct termination (eg. polyadenylation stop signal). 



The term "gene therapy" used herein, refers to a method which introduces a segment of 
exogenous nucleic acid into a cell in such a way that it results in functional modification 
to the recipient cell by expression of the exogenous nucleic acid. The exogenous nucleic 
acid is typically therapeutic in that the expression of the encoded protein, polypeptide or 
RNA corrects cellular dysfunction due to a genetic error or more generally counteracts 
any undesirable functions which are associated with a genetic or acquired disease. The 
term "exogenous nucleic acid" refers to DNA or RNA sequences not normally expressed 
in the treated transformed cell. The term also refers to DNA and RNA sequences which 
are expressed in a treated transformed cell at a higher, lower or in an otherwise different 
pattern than in the untreated, nontransformed cell. This non-natural expression can also 
be termed heterologous expression. 

A "gene therapy vector" refers to a vector used for gene therapy, i.e. to introduce the 
exogenous nucleic acid into a recipient or host cell. The exogenous nucleic acid may be 
transiently expressed or integrated and stably expressed in the recipient or host cell. 

The term "plasmid" as used herein refers to any nucleic acid molecule which replicates 
independently of the host, maintains a high copy number, and which can be used as a 
cloning tool. 

The term "parallel strand of DNA" and "anti-parallel strand of DNA" refers to as each of 
the strands of DNA of the double stranded adenovirus. The Figures diagram the location 
of certain nucleotides on the parallel strand of DNA. The anti-parallel strand of DNA 
refers to the other of the two strands of DNA which is not depicted in the Figures. The 
fibei* protein is encoded on the anti-parallel strand of DNA. To simplify the vector 
diagrams, the fiber sequences are shown on the parallel strand even though the gene is 
located on the anti-parallel strand. 
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The term "reporter gene" refers to any nucleic acid sequence which encodes a 
polypeptide or protein which can be readily detected by, for example, a visual, 
microscopic, immunochemical or enzymatic assay. Preferred reporter genes are those 
that can be detected by a non-destructive method that does not destroy the treated, 
transformed cells or tissue. 

The term "selection gene" used herein refers to any nucleic acid fragment which encodes 
a polypeptide or protein whose expression is used to mark a cell as a transformed cell by 
a given vector. 

The term "therapeutic gene" refers herein to a DNA fragment encoding a functional 
polypeptide, protein or RNA, which when expressed in a host cell provides a therapeutic 
benefit or desired function.to the host cell or to the organ or organism containing the host 
cell. The therapeutic benefit may result from modification of a function of a native gene 
in a host or from the additional function provided by the therapeutic protein, polypeptide 
or RNA. 

The term "host tissue" or "host cell" as used herein, refers to a tissue or cell in which a 
therapeutic gene is to be expressed to modify its function. 

It is well-known in the biological arts that certain amino acid substitutions may be made 
in protein sequences without affecting the function of the protein. Generally, 
conservative amino acid substitutions or substitutions of similar amino acids are tolerated 
without affecting protein function. Similar amino acids can be those that are similar in 
size and/or charge properties, for example, aspartate and glutamate, and isoleucine and 
valine, are both pairs of similar amino acids. Similarity between amino acid pairs has 
been assessed in the art in a number of ways. For example, Dayhoff et al. (1 978) in Atlas 
of Protein Sequence and Structure, Volume 5, Supplement 3, Chapter 22, pp. 345-352, 
which is incorporated by reference herein, provides frequency tables for amino acid 
substitutions which can be employed as a measure of amino acid similarity. Dayhoff et 
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al's frequency tables are based on comparisons of amino acid sequences for proteins 
having the same function from a variety of evolutionarily different sources. Therefore, 
any obvious changes in the amino acid sequences (as described above) to the sequences 
of the invention are already contemplated. 

Polypeptides which are "substantially similar" share sequences as noted above except that 
residue positions which are not identical may differ by conservative amino acid changes. 
Conservative amino acid substitutions refer to the interchangeability of residues having 
similar side chains. For example, a group of amino acids having aliphatic side chains of 
amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of 
amino acids having amide-containing side chains is asparagine and glutamine; a group of 
amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a 
group of amino acids having basic side chains is lysine, arginine, and histidine; a group 
of amino acids having sulfur-containing side chains is cysteine and methionine. 
Preferred conservative amino acids substitution groups include but are not limited to: 
valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, 
asparagine-glutamine, and aspartate-glutamate. Therefore, polypeptide substitution for 
"substantially similar" sequences (as described above) to the amino acid sequences 
described invention are already contemplated. 

In order that the invention herein described may be more fully understood, the following 
description is set forth. 

The present invention provides unique gene transfer vehicles which overcome many of 
the limitations of prior art vectors. The invention describes a first generation adenovirus 
vectors comprising left and right Ad ITRs, an Ad packaging sequence, a transgene 
cassette with regulatory elements, and a pair of cassette ITRs flanking the transgene 
cassette that direct predictable viral genomic rearrangements during viral replication as 
well as direct the integration of the transgene cassette into the host cell genome. 
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One predictable rearrangement that occurs during vira] replication is the generation of a 
gutless adenovirus vector (also referred to herein as AAd) that comprises right and left Ad 
ITRs, an Ad packaging sequence, a transgene cassette flanked by cassette ITRs and the 
gutless vector is devoid of all other immunogenic viral genes. 

5 

The potential for site-specific integration is an important characteristic of the novel Ad 
vectors of the invention. In an embodiment of the invention, integration of the transgene 
cassette is directed by co-infection with an Ad vector expressing e.g., the rep 78 protein 
to achieve site-specific integration in the e.g., AAVS1 site on human chromosome 19. 

10 

The invention further describes a novel way of targeting these recombinant adenovirus 
vectors to selected cells by modifying the adenovirus fiber protein that is expressed on 
the capsid. Changes to both the fiber shaft and the fiber knob domain proved to 
successfully retarget the Ad vector to a desired cell type. In addition, the G-H loop within 
15 the fiber knob domain is identified as a novel site that affect the binding affinity and 
specificity of the recombinant adenovirus vector. Substitution of peptide sequences into 
the G-H loop retarget the gutless vector to a desired ceil type. 

An adenovirus display library has been generated that expresses random peptides within 
20 the G-H loop of the fiber protein. This type of a. library is used as ligands to screen for 
adenovirus vectors that bind to desired cell types. One advantage of using an adenovirus 
display library versus a phage display library is that once adenovirus affinity to a desired 
cell is identified the targeted adenovirus vector is ready to accept a transgene cassette and 
can be used to generate a gutless adenovirus vector, for example for use in gene therapy. 

25 

The chimeric Ad vectors described below contain a modified fiber protein in the capsid 
of the adenovirus which renders the vector capable of infecting a desired cell types. 
Therefore, according to the invention, a gutless chimeric AAd-AAV vector can be 
generated to introduce any transgene(s) into any host cell or tissue which is normally 
30 refractory to most commonly used gene therapy viral vectors. In addition, the chimeric 
AAd.AAV vector of the invention, is devoid of adenoviral genes, and contains AAV ITR 
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sequences that flank the transgene cassette, which direct stable transgene integration in 
the host genome allowing long term expression of the transgene. 



The transgene cassette described in the invention may carry a transgene which is either a 
reporter gene, a selectable gene for in vitro or in vivo selection of transduced cells, or a 
therapeutic gene. In one embodiment of the invention the reporter trangene can be but is 
not limited to, Pgalactosidase. Many reporter genes are commonly used in the art, of 
which any could be carried as a transgene in the Ad.AAV vector of the invention. Other 
examples of reporter are genes are GFP and alkaline phosphatase. 

The following describes an embodiment of the first generation Ad vectors of the 
invention having a wild-type capsid and a transgene cassette flanked by cassette ITR 
sequences; (b) fiber protein that is modified to retarget Ad vectors; and (c) the 
combination of both technologies that enables the production of chimeric AAd vector 
including a modified fiber protein expressed on the capsid which retargets the base vector 
to a desired cell type for infection and transgene integration. 

A. Integrating Ad hybrid vectors of the invention: 

It has been shown that inverted repeats (IRs) inserted into the El region of AdEl- vectors 
can mediate predictable genomic rearrangements resulting in a gutless vector genome 
devoid of all viral genes. A specific embodiment of such IR-mediated rearrangements is 
the Adeno-AAV, first generation hybrid adenovirus vector containing AAV inverted 
terminal repeats (ITR) flanking a transgene cassette. The AAV ITRs mediate the 
formation of a genome similar to that of the AAd.IR genome (Steinwaerder et al., 2000 
Journal of Virology). AAd vectors devoid of all viral genes stably integrate and transduce 
cultured cells with efficiencies comparable to e.g. rAAV vectors. The Examples 
demonstrate by Southern blot analysis that the AAd vectors integrate randomly into the 
host genome. 

The Ad vectors of the invention comprise a left Ad ITR, an adenovirus packaging 
sequence located 3' to the Ad ITR; a transgene cassette located 3' to the packaging 
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sequence comprising a polyadenylation signal, a transgene, and a heterologous promoter, 
and flanked by a pair of cassette ITRs. Adenoviral genes used for replication such as El, 
E2, E3, E4 are located 3' to the right cassette ITR and a right Ad ITR is located 3* to the 
replication genes. The vectors of the invention are particularly suited to treat: genetic 
disorders, cancers, and infectious diseases (such as HIV, emboli, or malaria). Treatable 
genetic diseases such as hemophilia A and B; cystic fibrosis; muscular dystrophy, and 
^antitrypsin disorder are ideal candidates for genetic disease that can be treated by 
vectors of the invention. A specific example of a therapeutic gene to combat a genetic 
disorder is gamma-globin to ameliorate sickle cell anemia. 

To aid in the selection of transduced cellls and characterize the intergration site of the 
transgene cassette, an embodiment of the invention includes the addition of a sequence 
comprising a bacterial for the origin of replication, plus a selectable gene. An 
embodiment of this is an SNori sequence added to the transgene cassette. This allows the 
AAd to be expressed in human and bacterial cells, therefore allowing selection of the 
transduced cells and characterization of the integration site in the genome of transduced 
mammalian cells. 

The potential for site-specific integration is an important characteristic of the novel Ad 
vectors of the invention. In an embodiment of the invention, integration of the AAd.AAV 
is directed by co-infection with Ad AAV expressing the rep 78 protein in 293 cells to 
achieve site-specific integration in the AAVS1 site on human chromosome 19. For this 
type of site-specific integration to occur in cells other than 293 cells, E4 ORF6 
expression is required. The co-infection of AAd.AAV, AAd. rep 78, and AAd. E4-orf6 
allows for site specific integration of the AAd.AAV transgene cassette. The AAd. rep78 
and the AAd. E4-orf6 genomes are degraded soon after transduction, thus avoiding 
potential side effects. Site-specific integration is preferred over random integration, 
which is seen with rAAV and AAd.AAV, in order to reduce the risk of insertional 
mutagenesis. 
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Integration of the transgene cassette contained in the adenoviral vectors into 
chromosomes may be associated with silencing (or blocking) of transgene expression. 
The silencing of transgenes can be overcome by adding insulator elements to the 
transgene cassette. For example, HS-4 insulator elements derived from the chicken- 
5 globin LCR can function in Ad vectors to shield heterologous promoters from adenoviral 
enhancers. HS-4 insulators or the Drosophila Gypsy gene can also be used to prevent 
silencing transgenes. 

Another embodiment of the invention is to split the transgene cassette into two portions 
10 of the transgene each carried in a different recombinant adenoviral vector of the 
invention. Each portion of the same trangene has an overlapping region of homology. 
After infection with both vectors, each carrying the different but overlapping portion of 
the same transgene, homologous recombination event occurs resulting in the 
reconstitution of the complete transgene which is then expressed. This technique is used 
15 to produce hybrid adenoviral vectors that accommodate large inserts including, but not 
limited to a 13kb genomic hAAT gene or a 12kb yglobin LCRyglobin expression 
cassettes for ameliorating sickle cell anemia (or correcting y-globin mutations). The 
formation of the hybrid AAd vector genomes, after recombination between two vectors, is 
more efficient if the overlapping region of homology within the transgene is longer. 

20 

An advantage of the present invention is a method to rapidly isolate pure gutless hybrid 
adenoviral vectors such as AAd.AAV or AAd.AAV fx vectors. To minimize the 
contamination of AAd with first generation vectors (Ad vectors) a strategy is described in 
Example I H. It is anticipated that these approaches will yield the same titer of AAd 
25 vectors, however the contamination with full-length genome vectors will be less. This 
improved isolation of the vectors is extremely important to avoid toxic side effects after 
in vivo application. 

B. Tropism modified adenovirus vectors: 

30 



25 




WO 00/73478 PCT/US00/15442 

The Ad vectors of the invention can be modified so that they target a host cell of interest. 
There are more than 50 human Ad serotypes (Appendix I), including variants with 
different tissue selectivity or tropism. It is accepted in the art that different Ad serotypes 
bind to different cellular receptors and use different entry mechanisms. Most recombinant 
5 adenovirus vectors use adenovirus serotype 5 as the base vector serotype 5 (Ad5) (Hitt, 
M.M., et all, 1997, Adv. in Pharmacology 40, 137-205). Ad5 infection is primarily 
mediated by its fiber protein binding to CAR and secondarily by its penton base protein 
binding to integrin. Due to the lack of CAR and/or integrin expression on many cell and 
tissue types, Ad5 mediated gene transfer is inefficient in a number of tissues which are 
10 important targets for gene therapy such as endothelia, smooth muscle, skin epithelia, 
differentiated airway epithelia, brain tissue, peripheral blood cells, or bone marrow. The 
following describes Ad5 vectors of the invention having a change in infectivity and 
tropism as a result of altering the fiber protein sequence. 

!5 The infectivity of different Ad serotypes is limited to a number of human cell lines. 
Infectivity studies revealed that Ad5 and Ad3 are particularly suitable for infecting and 
targeting endothelial or lymphoid cells, whereas Ad9, Adl 1 and Ad35 efficiently infected 
human bone marrow cells. Therefore, the knob domain of the fiber protein of Ad9, Adl 1 
and Ad35 are excellent candidates for retargeting the Ad5 vector to human bone marrow 

20 cells. Other possible serotypes include Ad7. 

In the modified fiber protein of the invention the fiber knob domain of the Ad5 fiber has 
been replaced with another Ad serotype fiber knob domain. An embodiment of the 
invention is the modified Ad5/35 fiber protein (a recombinant Ad5 vector expressing a 

25 modified fiber protein comprising of a fiber tail domain of Ad5 and the fiber shaft and 
knob domains of Ad35). The Ad5/35 chimeric fiber protein shows a broader spectrum of 
infection to a subset of CD34+ cells, including those with stem cell activity. The Ad5/1 1 
chimeric fiber protein (a recombinant Ad5 vector expressing a modified fiber protein 
comprising the fiber tail domain of Ad5 and the fiber shaft and knob domains of Adl 1) 

30 showed similar tropism. 
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In addition to the knob domain modifications, the invention describes the added 
advantage of modifying both the fiber shaft domain and the fiber knob domain to produce 
a shortened fiber protein. The length of the fiber shaft domain plays a key role in the host 
receptors used for viral vector entry into the host cell. To show this Ad5, Ad5/9, and 
Ad5/35 variants were constructed with long (22p-sheets) and short-shafted (7p-sheets)- 
shafted fibers. These analyses demonstrated that efficient viral infection involving CAR 
as the primary receptor for Ad5, Ad5/9 requires a long-shafted fiber protein, whereas the 
cell entry strategy of Ad5/35 (which binds to an still uncharacterized non-CAR receptor) 
does not depend on the shaft length (Fig. 3). The modification in both the fiber shaft 
domain length (between 5 > 10 p-sheets) and the fiber knob domain (from a different 
Adserotype than the base vector is a novel mode of altering Ad vector tropism. 

To broaden the repertoire of cell types that Ad vectors can infect, a specific binding 
region, the G-H loop, within the knob domain has been newly identified herein to 
improve binding affinity and specificity. Alteration within this region will redirect the Ad 
vector to a desired cell type. For example, the invention describes the G-H loop sequence 
within the fiber protein knob domain, which can be replaced with heterologous peptide 
ligand sequences without affecting the functionally important tertiary structure of the Ad 
fiber knob domain, while changing the binding affinity and specificty of the vector (Figs. 
6,7). This G-H loop region is exposed on the central part of the knob surface and may be 
strategically a better site for incorporation of heterologous ligands than the peripheral H-I 
loop (Krasnykh, V. et al., 1998 . J. Virol, 72:1844-52.) of the knob C-terrninus (Michael, 
S. I., et al., 1995, Gene Ther. t 2:660-8., Wickham, T. J. et al.. 1996 , Nat. BiotechnoL. 
14:1570-3.), which are the substitution sites used by others. Therefore, these G-H loop 
modifications within the fiber knob domain will allow the Ad vector to be redirected to 
infect a desired cell type, as long as the G-H loop ligand sequence binds to at least one 
surface protein on the desired cell type. Fig. 7 shows some possible substitutions. 
Example II J demonstrates that the virion tolerates the insertion of a cycling peptide (12 
amino acids) with a constrained secondary structure that allows the exposure on the knob 
surface. A defined ligand (RGD) can be inserted into the G-H and the H-I loop of an 
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Ad5 capsid that is ablated for CAR, and integrin tropism. Infectivity studies show the 
potential advantage of this new insertion site. 



Use of the Vectors of the Invention for "Gene Therapy" 

The liver is the major organ for protein synthesis. Thereforean important goal of gene 
therapy is to target gene therapy vectors to the liver. To genetically correct many types of 
mutant proteins hepatocytes need to be infected with gene therapy vectors carrying a 
corrected transgene. Example II J describes a G-H loop substitution in the knob domain 
of the fiber protein with both RI and RII+ (of the malaria circumsporozite surface 
protein) in a short shafted fiber protein which directs the vector to have affinity and 
specificity to hepatocytes. 

Example II K applies a similar protocol to alter the fiber knob domain in the G-H loop 
region with peptides that target the vector breast cancer cell lines (MDA-MB-435). These 
novel approaches to redirect vectors described in the invention allow lower doses of the 
gene therapy vectors to be administered with a higher safety profile. 

In example II L a protocol for preparing an adenovirus display library is described that 
uses the fiber knob protein to display a library of random peptide sequences within the G- 
H loop. This library of adenoviruses with modified fiber proteins is screened for affinity 
and specificity for a desired cell type. There are two main advantages of using this 
adenovirus display library to screen for target peptides that allow binding to a desired cell 
type over a phage display library system. First, once a ligand peptide is identified that 
binds to the desired cell type it is already in the vector of choice for gene therapy 
delivery. The peptide does not need to be engineered into another vector, as is the case 
for the phage display library vectors. This reduces the steps required to identify a targeted 
fiber protein for a desired cell type. The second advantage of this method, is that the 
adenoviruses are able to display multiple copies of the modified fiber protein on their 
capsid. This allows for dimerization and trimerization of the fiber protein with the host 
cell receptor. The multimerization of fibers proteins is a realistic, in vivo interaction of 
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the trimeric fiber protein with the host cell receptor. In contrast, phage vectors can only 
display one fiber peptide sequence on their surface, which significantly limits the ability 
of interaction with host cell surface receptors. 

5 C. A chimeric adenovirus vector with selective tropism: 

The chimeric vectors of the invention combine two vectors: an Ad.ITR and a Ad. fx 
where fx describes a modified fiber protein. A first generation adenovirus vector of 
serotype 5 is the base vector that carries a transgene cassette flanked by heterologous 

10 ITRs. These specific inverted terminal repeat sequences, such as AAV ITRs direct stable 
integration of the transgene cassette into the host genome as well as control predictable 
genomic rearrangements that occur during viral replication. This vector can also carry a 
modified fiber gene (described in Examples II). During replication predictable genomic 
rearrangements occur which result in the generation of a gutless adenovirus vector (e.g. 

15 AAd.AAV*) which expresses the modified fiber protein on its capsid. The modified fiber 
protein allows the gutless vector to be targeted to a selected cell type. The targeted 
vector is a gutless adenovirus vector devoid of adenoviral genes which can integrate its 
transgene into the host genome. The transgene cassette can carry reporter, selectable, or 
therapeutic genes. 



In one embodiment of the invention, the gutless targeted AAd. AAV* carries the reporter 
gene of Pgalactosidase. AAd. AAV* ~BG). For easy in vitro selection of human and 
bacterial cells that are transduced with the hybrid Ad vector, a bacterial sequence for the 
origin of replication can be added to the hybrid Ad vectors. An example of this is 
25 AAd.AAV* -Snori, in which a SNori sequence is added into the transgene cassette. This 
site allows for G418 selection on cells infected with AAd.AAV* -SNori. This in vitro 
selection provides a tool to analyze the site of transgene integration and the flanking 
chromosomal regions. Fluorescent in situ hybridization (FISH) is an alternative method 
to confirm vector integration. 
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An advantage of the AAd chimeric vector for gene transfer is the efficient and stable 
integration of a large transgene cassette up to e.g. 22kb which is significantly larger than 
the capacity of retroviral vectors . This is of particular interest for gene therapy. For 
example, to ameliorate sickle cell anemia AAd.AAV* yglobin , an expression transgene 
cassette with the gamma-globin gene that targets and integrates, can be inserted into bone 
marrow stem cells for long term expression of the gamma-globin gene. 

To achieve site-specific gene integration, rep78 protein is used for transgene integration 
into the AAVS1 site (described in Example 1 D). However, this may silence transgene 
expression. To prevent the integrated transgene from being silenced by host genomic 
elements (such as positional effects or downstream enhancers), LCRs or insulator 
elements are incorporated into the transgene cassette.* 

The following examples are presented to illustrate the present invention and to assist one of 
ordinary skill in making and using the same. The examples are not intended in any way to 
otherwise limit the scope of the invention. 

EXAMPLE I 

NOVEL ADENOVIRAL VECTOR Ad.AAV 

A. Integrating AAd. AAV hybrid vectors devoid of all adenoviral genes. 

In vitro and in vivo studies with rAAV indicate that the only requirement for rAAV 
integration are the AAV ITRs and as yet unknown.host cellular factors. It is thought that 
specific sequences or secondary structures present in AAV ITRs are prone to integration 
into host chromosomal DNA. In order to combine advantages of adenoviral vectors (high 
titer, high infectivity, large capacity) and the integration capability of AAV ITRs, AAV 
vector DNA with AAV ITRs flanking cassettes a secreted human placental alkaline 
phosphatase (SEAP) - neomycin phosphotransferase (neo) reporter gene cassette 
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(Alexander, I.E., et al. 1996, Gene Therapy, 7, 841-850) is incorporated into the El- 
region of E1/E3 deleted adenoviral vectors (Ad.AAVl) (Figure 1, top). 

METHODS 



Production/Characterization of Viral Vectors 
Plasmids : 

10 The AAV1 vector cassette containing AAV ITRs and SEAP/neo expression units is 
obtained by Asel/Scal digestion of the plasmid pALSAPSN (Alexander, I. E. et al, 1996, 
Human Gene Therapy, 7:841-50). The 4.4kb AAV vector fragment was cloned via NotI 
adapter linkers into pXJCLl (Mirobix, Toronto, Canada) (pAd.AAVl). Another shuttle 
vector (pAd.AAVl -A2ITRs) lacking the AAV ITRs is generated by inserting the 3.7kb 

15 Aflll/BsmI fragment of pALAPSN into pXJCLl. For pAd.AAVl A1ITR, a construct is 
used where a spontaneous deletion in the left AAV ITRs between the A and A 1 regions 
has occurred. To create a second hybrid vector (Ad.AAV2), the AAVSNori cassette 
developed by E. Rutledge is used. AAV vector DNA obtained is from pASNori 
(Rutledge, E. A., Russell, D.W. 1997. Journal of Virology 71:8429-8436) as a 3.4kb 

20 Bsal/Scal fragment and inserted into the EcoRV site of pXCJLl. As it is generally 
known for AAV vector plasmids, the AAV ITRs are prone to rearrangements. To 
minimize deletions in these functional critical regions, all constructs for generation of 
hybrid vectors are assembled in low copy-number plasmids which are grown in E. coli 
ToplO, JC811, or XL1 Bluecells (Stragene, La Jolla, Calif.). Furthermore, after each 

25 cloning step or large-scale plasmid amplification, both AAV ITRs are carefully mapped 
by restriction analysis with enzymes that cut inside or adjacent to the ITRs (BssHII, 
Ahdl, Smal, Bgll, BsmI, Aflll, and Seal). 
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First-generation viruses with the different transgene cassettes incorporated into the El 
region are generated by recombination of the pAElaSpla- or pXCJLl -derived shuttle 
plasmids and pJM17 (Microbix) in 293 cells as described earlier (Lieber, A., et al., 1996, 
J. of Virology, 70, 8782-8791). For each virus, at least 20 plaques are picked, amplified, 
and analyzed by restriction digest. Viruses containing two AAV ITDRs tend to rearrange 
within the ITRs, with other adenoviral sequences, or with adenoviral sequences present in 
the 293 cell genome. Only plaques from viruses with intact ITRs are amplified, CsCl 
banded, and titered as described earlier (Kay, M. A., et al. 1995. Hepatology2\:8\5-$\9; 
Lieber, A., et al. 1996. Journal of Virology 70:8944-8960). All virus preparations tested 
are negative for RCA and bacterial endotoxin (Lieber, A., et al.1997. Journal of Virology 
71:8798-8807). Virus is stored at -80°C in lOmM Tris-Cl, pH 7.5-1 mM MgCl 2 -10% 
glycerol. 

To generate AAd.AAV, 293 cells are infected with Ad. AAV 1 at an multiplicity of 
infection (MOI) of 25 and harvested 40h after infection. Cells are lysed in PBS by 4 
cycles of freeze/thawing. Lysates are centrifuged to remove cell debris and digested for 
30 min at 37°C with 500 units/ml DNasel and 200 jig/ml RNaseA in the presence of 
lOmM MgCl 2 . 5ml of lysate is layered on a CsCl step gradient (0.5ml 1.5g/cm 3 , 2.5ml - 
1.35g/cm 3 , 4ml - 1.25g/cm 3 ) and ultracentrifuged for 2h at 35,000 rpm (rotor SW41). 
CsCl fractions are collected by puncturing the tube and are analyzed for viral DNA 
(Lieber, A., et al. 1996. Journal of Virology 70:8944-8960; Steinwaerder, D. S., et al. 
1999. J Virol 73:9303-13) or subjected to ultracentrifugation at 35,000 rpm for 18 hours 
in an equilibrium gradient with 1.32 g/cm 3 CsCl. The band containing the deleted viruses 
AAd.AAV is clearly separated (0.5cm distance) from other banded viral particles 
containing full-length ad.AAV genomes. AAd.AAVl fractions are dialyzed against 
lOmM Tris-Cl, pH 7.5- ImM MgCl 2 -10°/o glycerol and stored at -80 W C. The genome titer 
of AAd.AAVl preparations is determined based on quantitative Southern analysis of viral 
DNA purified from viral particles in comparison to different concentrations of a 4.4kb 
Asel/Scal fragment of pALSAPSN according to a protocol described earlier (Lieber, A., 



32 




WO 00/73478 PCT/US00/15442 
et al., 1996, J. of Virology, 70, 8782-8791). In total, the production of lxlO 13 genome- 
particles of AAcLAAVl requires less than 3 hours of actual work. 

Titers routinely obtained are in the range of 3-8x1 0 12 genomes per ml. Assuming one 
5 genome is packaged per capsid, the genome titer equals the particle titer. The level of 
contaminating Ad.AAVl is less than 0.1% as determined by Southern analysis, which is 
consistent with results obtained by plaque assay on 293 cells (fewer than 5 plaques per 
10 6 total genomes). The primers used for sequencing the left and right ITR-vector- 
junction are 

10 

5'GGCGTTACTTAAGCTAGAGCTTATCTG, and 
5'CTCTCTAGTTCTAGCCTCGATCTCAC 

The recombinant AAV virus stock containing the SEAP/neo cassette (AV2/ALSAPSN, 
15 [Alexander, I. E. et al, 1996, Human Gene Therapy, 7:841-50] used in these studies were 
obtained from Dusty Miller (FHCRC, Seattle). The stock was free of contaminating 
replication competent AAV (<50 particles/ml) and wildtype adenovirus (<100 
particles/ml). The genome titer of the virus stock was obtained by quantitative Southern 
Blots as described by Russell et al. (Russell, D. et al. 1994 Proc. NatLAcad,ScL USA 
20 91:8915-8919). 

Electron Microscopy: 

For examination of viral particles in the transmission electron microscopy studies, CsCI- 
25 purified virions. are fixed with glutaraldehyde and stained with uranyl acetate as described 
previously (Lieber, A., et al. 1 996. Journal of Virology 70:8944-8960). 

RESULTS 

30 During replication of these hybrid vectors in 293 cells, a 5.5kb genome (AAd.AAVl) is 
efficiently generated and packaged into adenovirus (Ad5) capsids. The AAdAAVl 
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genome contains the left adenovirus ITR and the packaging signal followed by the AAV- 
vector cassette and a duplicate of the adenoviral packaging signal and ITR in reverse 
orientation (Figure 1, bottom). The hybrid vector is devoid of all viral genes, thus 
eliminating toxic effects and the elicitation of cellular immune responses. The 
spontaneous formation of the small hybrid vector genome AAd.AAVl requires the 
presence of two intact AAV ITRs and does not occur with partly deleted ITRs or oligo- 
dC and oligo-dG stretches flanking the expression cassette. 

Hybrid vectors containing different transeenes: 

To construct a hybrid vector with a transgene that can be detected in situ the SEAP/neo 
expression unit in Ad.AAVl is replaced by the E. coli p-galactosidase gene. This hybrid 
vector is named AAd.AAVl. During generation of the corresponding plasmid constructs 
the AAV ITR sequences tend to rearrange and abolish their functional properties. This 
problem can be circumvented by using low copy number plasmids as cloning vectors 
grown in bacteria strains depleted for all recombination proteins (e.g.JC811). 
Furthermore, the intactness of both AAV ITRs after each cloning step can be examined 
for characteristic endonuclease digestion. Recently, another hybrid vector 
AAd.AAVlNori has been generated which contains the neo gene under the control of 
both the simian virus 40 (SV40) early promoter and the transposon 5 (Tn5) promoter for 
expression in human and bacterial cells, as well as the pi 5 A bacterial replication origin 
with the direction of the leading strand DNA synthesis opposite that of neo gene 
transcription. Thus, SNori can be used for G418 selection of integrated vector in 
eukaryotic cells as well as for rescue of vector together with flanking host DNA after 
integration. The recovered plasmids can be propagated in E. coli under selection with 
kanamycin due to the bacteria] origin and the neo gene. SNori containing vectors allow a 
rapid estimation of total integration events based on the number of G418 resistant 
colonies. Moreover, vector DNA together with flanking chromosomal DNA can be 
rescued as plasmids from single G418 resistant clones and can be used for sequencing to 
determine integration junctions. Both hybrid vectors are produced at a titer of about 
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3x10 genomes per ml. The ratio of genome titer to transducing particles for 
AAd.AAVBG is -200:1 based on P-Gal expression. 

DISCUSSION 



AAd.AAVl could spontaneously form during adenovirus replication. Another possible 
. mechanism of AAd.AAVl formation is based on the unique mechanism by which 
adenovirus replicates its genome (van der Vliet, B., 1995, In w. Doerfler, et al. (eds.) vol. 
2 p. 1-31, Springer- Verlag, Berlin) (see Figure 1). Ad DNA replication is initiated by the 

10 TP/pTP (terminal protein) that binds to specific sites within the ITRs on both ends of the 
linear genome and catalyzes, in complex with Ad pol, the binding of the 5* CTP, the first 
nucleotide of the daughter strand. DNA synthesis proceeds in a continuous fashion to the 
other end of the genome (Figure 1 A). Only one of the DNA strands serves as template. 
One of the replication products is a single-stranded DNA that circularizes through 

15 annealing of its self-complementary ITRs. The resulting duplex "panhandle" has the 
same structure as the termini of the duplex viral genome that allows the binding of pTP 
and the initiation for synthesis of a complementary strand using the single-stranded 
"pandhandle" molecule as template (Figure 1C). In the case of Ad.AAVl, the Ad pol 
synthesizes the single strand of the adenoviral genome starting from the left Ad ITR until 

20 it reaches the second AAV ITR. During synthesis of the second AAV ITR a certain 
percentage of the single-stranded molecules form a loop hybridizing to the 
complementary region within the first AAV ITR that was replicated earlier, allowing Ad 
pol to use the same viral DNA strand to read back towards the left ITR (Figure IB). The 
resulting "panhandle" structure can be resolved in a similar way as a full-length 

25 , intermediate shown in Figure 1C, generating a double stranded, linear molecule with the 
above described structure that can be packaged into Ad virions. The ratio of viral DNA 
to protein concentration in purified AAd.AAVl particles is comparable to that obtained 
from Ad.AAVl particles. This indicates that despite the smaller size, only one 
AAd.AAVl genome is packaged, resulting in particles with a lighter buoyant density 

30 (-1.32 g/cm 3 ). Electron microscopy demonstrates the icosahedral shape of AAd.AAVl 
particles (Figure 2). Staining with uranyl acetate causes the central viral cores to appear 
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electron dense. AAd.AAVl virions have only a spotted luminal dark staining as expected 
with only one 5.5kb genome being packaged per capsid. 



B. In vitro AAd.AAVl production: 

Characteristics of deleted adeno-AAV vectors (AAdAAV): 

A number of experiments to clarify the mechanisms of AAd.AAV genome formation are 
carried out. Specifically, the presence of two intact AAV ITRs flanking a reporter gene 
cassette is required for the effective formation of AAd.AAV genomes. This process does 
not occur with partially deleted ITRs or oligo-dC and oligo-dG stretches flanking the 
expression cassette. Furthermore, in vitro transduction studies are performed with 
different genome titers of AAd.AAVl, Ad.AAVl, and Ad.AAVl-A2ITRs, (lacking the 
two AAV ITRs) which determine the number of G41 8 resistant colonies that formed after 
4 weeks of selection (Table I). 

AAd.AAVl is routinely produced at a high titer (5 x 10 12 genomes per ml with >10 4 
produced genomes per 293 cell) and at a high purity with less than 0.1% contaminating 
full length Ad.AAVl genomes by a technique normally used for amplification and 
purification of recombinant adenovirus. 

In vitro transduction studies with hybrid vectors on CD34+ cells and ervthroleukemia 
cells: 

In order to test whether the hybrid vectors allow for gene transfer into cell types, that 
have to be targeted for sickle cell therapy, infection/transduction studies are performed 
using CD34+ enriched human bone marrow cells, derived from mobilized peripheral 
blood and the human erythroleukemia cell line K562 which express e and y globin genes. 

METHODS 
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Cell Culture: 

SKHepl cells (HTB-52, American Type Culture Collection, Rockville, MD), an 
endothelial cell line derived from human liver [Heffelfinger, S.C., et al., 1992, In vitro 
5 Cell Dev. Biol 28 A, 136-4-142], are grown in high-glucose Dulbecco's modified Eagle 
medium with 10% fetal calf serum. SKHepl cells are analyzed for integrated AAV 
provirus by Southern analysis of genomic DNA using the AAV1 wild type genome 
obtained from p AAV/Ad (Samulski, R. J., et al. 1989. Journal of Virology 63:3822-3928) 
(gift from David Russell, University of Washington) as a probe. No specific bands are 

10 detected in undigested genomic SKHepl DNA or after digestion with Hindlll. For viral 
infection, confluent cells are incubated with different viral doses for 2 hours, followed by 
intensive washing. For G418 selection, 24h after infection with AAd.AAVl, SKHepl 
cells are trypsinized and plated at different dilutions under G418 selection (900|ig/ml 
active compound, Boehringer-Mannheim, Germany). G418 containing culture medium is 

15 changed every 3 days. The number of colonies with >10 4 cells is counted after 4 weeks 
of selection and divided by the number of initially seeded cells. This ratio is used to 
express the integration frequency of AAd.AAVl. Single colones are obtained by limiting 
dilutions of infected cells in 96 well plates. Colonies are expanded to 1x1 0 6 cells in the 
presence of G418. Immunofluorescence analysis for adenoviral proteins expressed in 

20 SKHepl cells 3 days post- infection is performed as described earlier [Lieber, A., et al., 
1996, J. of Virology, 70, 8782-8791]. 

RESULTS 

25 293 cells are infected with the first generation vector Ad.AAVl. During replication of 
Ad.AAVl, the small AAd.AAVl genome forms spontaneously and is packaged into 
adenovirus capsid. At 36 hours after infection cells are harvested and virus is released by 
several cycles of freeze/thawing. The mixture of Ad. AAV 1 and AAd.AAVl particles in 
the cell lysate is then separated by ultracentrifugation in a CsCl step gradient. Due to its 

30 lighter buoyant density, the band containing the AAd.AAVl particles is clearly separated 
(0.8cm distance) from the band containing full-length virus (Lieber, A., et al. 1999. J 
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Virol 73:9314-24). AAd.AAVl is purified further by an additional CsCl equilibrium 
gradient and is stored in lOmM Tris pH7.5, 10% glycerol, ImM MgCl 2 in 80*C. In total, 
the production of 2x1 0 13 (genome) particles of AAd.AAVl requires less than 3 hours of 
work. All functions for AAd.AAVl replication and particle formation are provided from 
5 Ad.AAVl genomes amplified in the same cell. The efficiency of vector production 
measured on a genome-per-cell-basis is comparable or higher than labor-intensive, newer 
techniques for rAAV production, which have not yet been proven to be reliable. The 
estimated ratio of transducing/genome titer for AAd.AAVl is 1:200 (based on SEAP 
expression at day 3 post-infection), whereas for the average rAAV preparation, it is in the 

10 range of 1:10 3 to 1:10 4 . lxl 0 5 confluent SKHepl cells are infected with different MOIs 
of rAAVl (stock: lxlO 10 genomes per ml), AAd.AAVl (stock: 5xl0 12 genomes per ml), 
Ad. ADA VI (stock: lxl 0 13 genomes per ml), and AdAAV1.2ITR (stock: 9xl0 12 genomes 
per ml), in a volume of 100ml 24 hours after infection, cells are washed, trypsinized, and 
plated at different dilutions. G418 is added 24 hours after plating and selection is 

15 performed for 4 weeks. G418 resistant colonies contain on average >5xl0 4 cells (at least 
16 cell divisions). A significant number of small colonies visible at 2 weeks post- 
infection do not survive continued selection, probably due to episomal vector expression. 
Cells infected with first-generation adenoviruses with MOIs greater than lxl 0 4 develop 
CPE during the first week of selection. The rAAV titer is not high enough to perform 

20 infection studies with MOIs greater than 10 4 . The colony formation is expressed as 
percentage of the number of colonies after selection to the number of cells initially 
seeded for selection (Table I). 

TABLE I 



25 

Formation of G418 resistant colonies after infection with hybrid viruses in 
comparison with rAAV. 



MOI 




Formation of G418 resistant colonies in % (SEM) 


(genomes 


per 


(after 4 weeks of selection) 


cell) 
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rAAVl 


.Ad.AAVl 


Ad.AAVl 


AdAAVl 
•2ITRs 


10 1 


0 


0 


0 


0 


10* 


0 


0 


0 


0 


10* 


2.7 (1.6) 


1.3 (1) 


5.4(3.0) 


0 


10 4 


90.8 (7.0) 


48.0 (8.9) 


12.9 (7.2) 


0 


10* 


N/A 


93.1 (5.4) 


3.8 (2.1) 


0 


10° 


N/A 


100 


0 


0 


10' 


N/A 


100 


0 


0 



N = 3 (SEM is indicated in parentheses.) 



K562 cells are infecte d with different MOIs ofbAdAAVBG eenomes ver cell): 

Three days after infection, the total number of viable cells (based on Trypan blue 
staining) and the percentage of infected cells (based on X-Gal staining) are determined 
for all MOIs. The results are presented in Figure 4A. 

Initial integration studies: 

K562 cells are incubated with AAd.AAVSNori at an MOI of 2x10 s genomes per cell and 
the colonies that formed after 4 weeks of G418 selection are counted in 96 well plates. 
G418 resistant colonies contain on average >5xl0 4 cells which means that the original 
cell underwent at least 16 cell divisions. 

Infection studies with AdAAVBG (1-1 (f eenomes per cell) on CD34+cells: 

Cell infection on CD34+are as described for K562 cells. CD34+ cells are cultured in 
IMDM supplemented with 20% FCS, kit ligand (stem cell factor-SCF) (lOOng/ml), and 
1L-3 (lOOng/ml). Since a number of reports suggest that specific cytokines like GM-CSF 
or M-CSF which induce stem cell differentiation can stimulate integrin expression and 
may therefore affect internalization of Ad5 vectors, infection rates are compared with 
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Ad5 based hybrid vectors on CD34+ cells cultured with and without pre-stimulation with 
GM-CSF (50ng/ml) or M-CSF (50U/ml). The number of infected cells is counted based 
on X-Gal staining at day 3 after infection. To test for dose-dependent toxicity, viable 
cells are counted (based on trypan blue exclusion) at day 3 post-infection. Furthermore, 
whether high viral doses affect the ability of CD34+ cells to differentiate in methyl 
cellulose colony assays in presence of IL-3 and SCF is analyzed. The results are 
expressed as viable cells/X-Gal positive cells vs MOI (see Figure 4B). 

DISCUSSION 

The above data demonstrate that AAd.AAV transduces stably an immortalized human 
cell line with a low frequency comparable to rAAV, however, transduction rates could be 
scaled up to 100% by using greater MOIs of AAd.AAVl, which is produced at higher 
titers than rAAVl. In contrast to infection with the first-generation vector, Ad.AAVl, 
infection with AAd.AAVl is not associated with dose-dependent cytotoxicity because no 
viral proteins are expressed from these vectors in transduced cells. Furthermore, viral 
proteins present in the incoming AAd.AAVl particles are not problematic in the dose 
range use. The comparison of transduction rates of AAd.AAV/Ad.AAVl with the vector 
lacking AAV ITRs, Ad.AAV-A2ITRs, supports the hypothesis that the presence of two 
intact AAV ITRs is crucial for hybrid vector integration. 

The data demonstrate that the leukemia cell line can be infected at -90% efficiency with 
Ad5 based hybrid vectors at MOIs 2xl0 5 genomes per cell without significant toxic side 
effects. However, this dose is still -100 times greater than the dose necessary to infect 
100% of HeLa cell, hepastoma cells, primary hepatocytes and other cell lines generally 
considered as permissive for Ad5 vector infection. 

Since viral DNA in cells infected with 2xl0 4 genomes (or 100 transducing particles per 
cell) should be lost after 7 cell divisions, the presence of G418 resistant cells in the 
observed colonies suggests that AAd.AAVSNori genomes are integrated into or stably 
associated with the host genome. Based on the number of G418 resistant colonies one 
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out of 25,000 AAd.AAVSNori genomes integrates stably into K562 cells. This is in 
agreement with the results obtained earlier with AAd.AAVl in SK Hepl cells. 



The maximal dose used for infection of CD34+ cells (1x1 0 8 ) results in X-Gal staining of 
only -10% of cells independently of GM-CSF/M-CSF. This demonstrates the obvious 
inability of Ad5 to infect CD34+ cells and is probably caused by the absence of specific 
receptors and/or integrins on the cell surface. CD34+ cells tolerate a large range of viral 
doses (1-1 0 7 ) without obvious effects on cell viability and total cell number. This is not 
surprising because in order to develop toxic side effects adenovirus has to enter the cell 
and express viral genes. Hybrid vectors can be produced at titers of 5x1 0 12 genomes per 
ml. Thus, the maximal MOI that can be used for infection (of 10 4 cells) is ~5xl 0 8 (in 
lOO^il storage buffer). Based on the infection studies with AAd.AAVBG this dose may 
not be sufficient to efficiently transduce CD34+ cells and to obtain an appreciable 
number of G41 8 resistant colonies. 

C. In Vivo Properties Of AAd.AAVl: 

Viral DNA is labeled with BrdU during virus amplification to investigate cellular/nuclear 
vector uptake in situ. For transduction studies, confluent SKHepl cells (a human 
endothelial cell line) are infected with 2000 genomes AAd.AAVl or Ad.AAVl per cell. 
BrdU tagged viral DNA is detected in 100% of nuclei at 3 hours post-infection for both 
viruses indicating efficient cellular and nuclear uptake of hybrid virus DNA. 

RESULTS 

The AAd.AAVl vector transduces a cell in vitro forming G418 resistant colonies with an 
efficiency of 17 or 58%, after infection with an MOI of IxlO 3 or IxlO 4 genomes per cell, 
respectively. Approximately 2xl0 4 AAd.AAVl genomes are required to yield one stable 
transfectant. Since all stable colonies contain integrated AAd.AAVl vector DNA, this 
number reflects the minimal integration frequency of AAd.AAVl in SKHepl cells which 
is comparable with that from rAAV (Rutledge, E. A. et al., 1997, Journal of Virology, 
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71:8429-36). The number of G418 resistant colonies does not necessarily represent the 
total frequency of integration events because not all integrated copies express neomycin 
phosphotransferase, due to chromosomal position effects or incomplete integration. 

5 The absence of adenoviral gene products in. AAd.AAVl transduced cells at day 3 post- 
infection is demonstrated by immunofluorescence with antibodies to the major late 
proteins (hexon, fiber) and early proteins (DBP.E4-orf6). Expressed adenoviral proteins 
are detected only in cells infected with Ad. AAV L The fact that cells infected with 
AAd.AAVl do not express potentially cytotoxic adenoviral proteins is important. 

10 

While an MOI of lxl 0 4 genomes per cell of the first generation vector Ad. AAV 1 induce 
cytopathic effects in SKHepl cells at day 3 p.L, no toxic side effects are observed when 
SKHepl cells are infected with AAd.AAVl at a dose of up to 1x10 genomes per cell. 
Since the transduction efficiency is clearly dose dependent, AAd.AAVl (which can be 
15 produced at titers of >5xl0 12 genomes/ml) is able to stably transduce Ad5 permissive cell 
lines or tissues with a 100% efficiency without associated toxicity. 

Southern analysis indicates that AAd.AAVl integrates randomly as head-to-tail tandem 
repeats into the host cell genome via the right AAV ITR, whereas the other junction with 

20 the chromosomal DNA is variable and occurs somewhere within the transgene cassette. 
In order to confirm the integrated status of AAd.AAVl DNA, high-molecular- weight 
chromosomal DNA is separated by pulse field gel electrophoresis (PFGE), followed by 
Southern analysis with a SEAP specific probe (Figure 3). Undigested DNA from control 
SKHepl cells give an endogenous SEAP signal that co-migrates with chromosomal DNA 

25 just below the well (lanes 1 and 5). No high-molecular weight episomal forms of 
AAd.AAVl DNA are detected, whereas a distinct 35 kb band is visible in DNA from 
SKHepl cells isolated 3 days after infection with first generation adenovirus, Ad.AAVl 
(lanes 4 and 13). Digestion with EcoRI reveals the 4.4kb fragment, which is specific for 
integrated tandem copies of the AAV cassette (lanes 8 and 12). To eliminate the 

30 possibility that chromosomal DNA is trapped in the well, DNA samples are digested with 
intron-encoded endonucleases Pl-Scel or I-Ceul (Gibco-BRL, Grand Island, NY) with a 
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sequence specificity or more than 11 bp or 9bp respectively. Digestion with Pl-Scel 
yields a >2mb endogenous SEAP signal in SKHepl cells (lane 2) and an additional signal 
in the range of — Imb in G418 resistant colonies transduced with AAd.AAVl (lane 7). I- 
Ceul digestion results in a smear between 250-1000 kb in AAd.AAVl transduced 
5 SKHepl -cells (lanes 10, 11) indicating random integration, whereas a high-molecular 
weight band specific for the endogenous SEAP gene is observed in control SKHepl cells 
* (lane 9). 

One day after intraportal infusion of 1x1 0 12 AAd.AAVl genomes in C57B1/6 mice, BrdU 

10 labeled vector genomes can be detected in 85% hepatocytes (Lieber, A., et al. 1999. J 
Virol 73:9314-24). Hepatocellular DNA analysis performed at 2 months post-infusion 
reveals AAd.AAVl DNA integrated with an average of 0.5 copies per cell into the mouse 
genome (Lieber, A., et al. 1999. J Virol 73:9314-24). To assess potential side effects of 
intraportal AAd.AAVl infusion, serum glutamic pyruvic transaminase (SGPT), a 

15 sensitive marker for hepatocellular injury, is measured for 7 consecutive days post- 
infusion in combination with histological analysis of liver sections. No significant 
elevation in SGPT levels, or histological abnormalities are detected after intraportal 
infusion of lxl 0 12 or lxlO 13 AAd.AAVl genomes, whereas infusion of the same dose of 
full-length Ad. AAV 1 vector is associated with severe hepatoxicity or fatal outcome. This 

20 suggests that the dose of AAd.AAVl administered to mice can be increased to obtain 
higher transduction efficiencies in vivo without adverse side effects, which is not possible 
for first generation adenoviruses. Importantly, AAd.AAVl transduced quiescent 
hepatocytes in vivo, which suggests that integration of hybrid vector DNA may not 
require cell proliferation. Recently, more detailed in vivo transduction studies with 

25 Ad.AAVl and AAd.AAVl have been performed in Balb/c mice to study whether the 
absence of adenoviral gene expression in cells infected with AAd.AAVl can avoid an 
anti-viral immune response and can prolong vector persistence. In this mouse strain, 
vector DNA is cleared from the liver at 4-6 weeks after infusion with first generation 
adenoviruses, mostly due to a CTL response against viral proteins produced in transduced 

30 cells. Vector DNA is analyzed by genomic Southern Blot of hepatic DNA at 12 weeks 
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after infusion of lxlO 12 genomes Ad.AAVl or AAd.AAVl. At this time point, no vector 
specific signal is detectable in hepatic DNA from mice infused with the first generation 
vector Ad.AAVl, while -0.3 copies of AAd.AAVl genomes per cell are present in livers 
of mice that received the hybrid vector, again indicating the superior in vivo properties of 
the hybrid vector. 

D. Effects of Ren Coexnression on AAd.AAV Integration 

Rev expression after plasmid transfection: 

In order to test whether Rep expression enhances site-specific integration of AAd.AAVl 
in human cells, a series of Rep expression plasmids are constructed. 

METHODS 

The Rep ORF 68/78 (nt 285-2313) including the internal pl9 and p40 promoters is 
obtained from pAAV/Ad (Samulski, R. J. et al., 1991, In B. N. Fields, et al. (eds.), 
Fields Virology, vol. 2 Lippincott-Raven Publisher, Philadelphia) by digestion with 
Bsal/BsrL This fragment deleted for the AAV p5 promoter is cloned via adapter linkers 
under RSV or PGK promoter in front of the bovine growth hormone polyadenylation 
signal (bPA) into pAd.RSV or pAd.PGK (Lieber, A., and Kay, M.A., 1996, J. of 
Virology, 70, 3153-3158; Lieber, A., et al., 1995, Human Gene Therapy, 6, 5-11) 
correspondingly. 

RESULTS 

The resulting plasmids (pRSVrep, pPGKrep) are transfected into 293 cells or SKHepl 
cells, most of the Rep proteins expressed from the heterologous promoters (RSV or PGK) 
are Rep 68 and Rep 78, while transfection of the rep gene under aP5 promoter 
(p AAV/Ad) results in predominant Rep 52/40 expression. Thus, transfection of pRSVrep 
and pPGKrep is more pronounced suggesting a strong transactivation of AAV promoters 
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by Ela which is produced in 293 cells. This result indicates that minimum expression of 
rep proteins is necessary to avoid interference with adenovirus replication. 

Rep-mediated site-specific integration of AAdAA VI . 

5 

The potential for site-specific integration is an important characteristic of the novel 
Ad. AAV vectors of the invention. In an embodiment of the invention, integration of the 
AAd.AAV is directed by co-infection with Ad AAV expressing the rep 78 protein to 
achieve site-specific integration in the AAVS1 site on human chromosome 19. For this 

10 type of site-specific integration to occur in cells other than 293 cells, E4 ORF6 
expression is required. The co-infection of AAd.AAV, AAd. rep 78, and AAd. E4-orf6 
allows for site specific integration of the AAd.AAV transgene cassette. The AAd. rep78 
and the AAd. E4-orf6 genomes are degraded soon after transduction, thus avoiding 
potential side effects. Site-specific integration is preferred over random integration, 

15 which is seen with rAAV and AAd.AAV, in order to reduce the risk, of insertional 
mutagenesis. 

A preliminary test can be performed to confirm the functional activity of Rep 68/78 
expressed from pRSVrep to mediate site-specific integration of AAd.AAV 1 (Figure 5 and 

20 6). Human SKHepl cells are transfected with pRSVrep or control plasmid (pRSVbGal 
(Lieber, A., et aL, 1995, Human Gene Therapy, 6, 5-11) (transfection efficiency was 
-20%), followed by infection with AAd.AAV (2000 genomes per cell). Three days after 
infection, cells are trypsinized, embedded in agarose, lysed in situ, digested with I-Ceul 
(an intron-encoded endonuclease with a recognition sequence of more than lOnt), 

25 subjected to pulse file gel electrophoresis in 1% agarose gel, and analyzed by Southern 
Blot. Hybridization with a probe covering the AAVS1 integration site (1.7kb 
EcoRI/BamHI fragment from the chromosome 19 locus (Samulski, R. J. et al., 1991, In 
B. N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott-Raven Publisher, 
Philadelphia) ) reveals an AAVS1 -specific band (~240kb) in I-Ceul digested DNA from 

30 cells after control plasmid transfection (pCo) + AAd.AAVl infection. An additional 
signal in the range of 280kb appears in rep expressing cells infected with AAd.AAVl 
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(pRSVrep + VAd.AAVl) indicating a site-specific insertion into the AAVS1 site in a 
certain percentage of cells. The presence of vector DNA in this 280kb band is confirmed 
by rehybridization of the same filter with a transgene (SEAP) specific probe. Randomly 
integrated AAd.AAVl vector appears as a diffuse SEAP signal in the range 280-680kb 
5 (pCo+AAd.AAVl, pRSVrep+AAd.AAVl). The specific -1.9 mb band on blots 
hybridized with the SEAP probe represents an I-Ceul fragment containing the 
endogenous human SEAP gene. 

Incorporation Rep 68/78 function into hybrid vectors to stimulate site-specific integration 

10 Rep overexpression inhibits adenovirus DNA replication, prohibiting the generation of 
rep expressing Ad vectors using conventional strategies. To solve this problem, 
. significant Rep 68/78 expression from the hybrid vector in virus producer (293) cells 
must be prevented while maintaining transient Rep expression in target cells (HSC) to 
mediate site-specific integration. Our hypothesis is that the specific structure of the 

15 AA±AAV hybrid virus can be used to bring the rep gene 68/78 into a transcriptionally 
active position under control of a HSC specific promoter only at late stages of virus 
replication in 293 cells. This will allow amplification of the hybrid vector in 293 cells, 
generating high titer virus which activates the incorporated Rep 68/78 functions only in 
HSC. The general outline of our strategy to produce Rep expressing hybrid vectors is 

20 illustrated in Figure 7. The rep/transgene cassette is assembled based on the left-hand 
shuttle plasmid used for recombinant adenovirus production. The gene encoding Rep 
68/78 is cloned in 3*D5' orientation in front of a transgene expression cassette flanked by 
AAV ITRs. Between the transgene cassette and the right AAV ITR an HSC-specific 
promoter is inserted with direction towards the adenoviral E2, E3, and E4 genes. The 

25 recombinant genome is produced by recombination in E.coli and transfection into 293 
cells generates virus (Ad.AAV-rep). The specific structure of ZlAd.AAV with duplicated 
sequences flanking the AV ITRs is used to bring the rep gene into a transcriptionally 
active position under control of a HSC specific promoter only during late stages of viral 
DNA replication in 293 cells. During amplification of Ad.AAV-rep, the smaller genome 

30 idAd.AAV-rep is formed and packaged into particles, which can be separated by 
ultracentrifugation in CsCl gradients. The specific structure of AAd.AAV-rep brings the 
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rep gene into 5V->3' orientation in relation to the HSC specific promoter, allowing rep 
transcription in target cells. After transduction of HSC with purified ^lAd.AAV-rep 
particles, rep expression is activated and mediates rescue of the AAV-ITR/transgene 
cassette from the adenoviral vector backbone and site-specific integration. The 
5 hypothesis is that Rep-mediated integration into AAVS1 occurs via the right or both 
AAV/ITRs causing the rep gene to become separated from the hepatocyte-specific 
promoter once the vector is integrated (Figure 7). Therefore, rep expression should be 
only transient without critical cytotoxic side effects on the host cell. 

10 Promoters that can regulate rev expression: 

Potential candidate promoters to drive rep expression with high specificity for HSC and 
minimal activity in 293 cells are the 454nt CD34 promoter (Krause, D.S., et al., 1997, 
Experimental Hemotology, 25, 1051-1061; Yamaguchia, Y. et al., 1997, Biochimica et 

15 Biophysica Acta., 1350:141-6), the. 300nt HS 40 enhancer (Chen, H.L., et al., 1997, 
Nucleic Acids Res. 25, 2917-2922) or a 3kb CD34 enhancer (May, G. et al., 1995, EMBO 
J., 14:564-74) in combination with an initiator, or the HIV LTR. An optimal promoter is 
selected based on studies of transient reporter gene expression after plasmid transfection 
in 293 cells and hepatocytes. All promoters to be tested are cloned in front of the human 

20 a] -antitrypsin (hAAT)- bovine growth hormone polyadenylation signal (bPA) into the 
adenoviral shuttle plasmid pCD2 (pAd.-hAAT). Promoter activity can be tested in 
transient plasmid transfection assays in CD34+ and 293 cells. The promoter with the 
highest hAAT levels in CD34+ or K562 cells and the lowest hAAT expression in 293 
cells is selected for further studies. If high background expression in 293 cells from these 

25 promoters is seen, insulators to shield HSC-specific promoters from the Ela enhancer 
which is still present in Ad shuttle plasmids can be utilized. 

Rev genes : 

30 The large Rep 68/78 proteins are sufficient to mediate rescue and site-specific 
integration. Unregulated Rep 52 and Rep 40 expression from the AAV pi 9 promoter 
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located within the ORF of Rep 68 and 78 must be prevented because production of these 
smaller Rep proteins in 293 cells will affect cell viability and adenoviral DNA synthesis. 
To do this, constructs obtained from Surosky et al., containing a mutated Rep 52/40 start 
codon to express Rep 68 and 78 individually under CMV promoter can be used. The 293 
5 cells transiently expressing Rep68 or Rep 78 from these constructs can be coinfected with 
AAd.AAVl (infection 24 hours after pCMVRep transfection, MOI 2xl0 5 genomes/cell). 
Three days after AAd.AAVl infection, cellular DNA is analyzed for AAVS1 -specific 
integration events by PCR and PFGE as described earlier. Efficient Rep mediated 
excision of the AAV cassette and site-specific integration without flanking adenoviral 
10 sequences are expected and the plasmids pCMVRep68 or pCMVRep78 can be used as a 
source for the corresponding rep genes and clone them into hybrid vectors. 

Vectors: 

15 The rep/transgene cassette can be assembled based on pXCJL (Microbix, Toronto). A set 
of control hybrid vectors can be generated with the AAV-ITR-transgene cassette only 
without the rep gene. The recombinant Ad.AAV-rep genome can be generated by 
recombination of the left hand shuttle plasmids with pCDl, a pBHGlO (Microbix, 
Toronto) derivative, which contains the Ad5 genome deleted for the E1/E3 regions in 

20 recA + E. coli (Chartier, C„ et al., 1996, J. of Virology, 70, 4805-4810). Compared to the 
standard technique based on plasmid recombination in 293 cells, this approach has the 
advantage that plaques with recombinant virus appear 3 times faster and the production 
of illegitimate recombinants is minimized. This allows efficient viral DNA amplification 
and packaging to occur before Rep expression reaches levels that are potentially 

25 inhibitory for adenoviral replication. The critical variables in maximizing the output of 
the vector deleted for all adenoviral viral genes are the initial multiplicity of infection and 
the time of harvesting. These parameters can be optimized for production of AAd.AAV- 
rep hybrid vectors. A number of AAd.AAV vectors can be constructed incorporating rep 
gene. Cryptic promoter and enhancer elements present in the 5'-342nt of the adenoviral 

30 genome can interfere with transgene expression from the heterologous promoters. This is 
crucial for the strategy to avoid rep expression from AAd.AAV-rep genomes in 293 cells. 
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To ensure efficient transgene expression, insulator fragments such as the chicken beta- 
globin insulator can be used with a selected promoter, constitutive or inducible. 

Rep protein co-packaging: 



As an alternative to producing hybrid vectors containing the rep 68/78 gene, studies are 
designed to see whether Rep protein can be co-packaged into AAd.AAV capsids and 
whether these co-packaged Rep molecules are sufficient to mediate rescue and site- 
specific integration of the AAV-ITR-transgene cassette. Our hypothesis is that the Rep 

10 68/78 binds to the Rep binding site (RBS) present in double-stranded AAd.AAV genome 
and that this complex is co-packaged into adenoviral capsids which are spacious enough 
to accommodate extra proteins. Based on protein/DNA ratio analysis performed 
previously in purified particles that only one 5.5kb AAd.AAV 1 genome is packaged per 
capsid. This is confirmed by electron-microscopy of AAd.AAV 1 particles, which reveals 

15 only spotted electron-dense staining associated with viral cores and extended free luminal 
space (see Figure 2). 

293 cells are transfected with plasmids expressing Rep 68/78 under the CMV promoter 
and the kinetics of rep expression is determined by Western Blot with cell lysates 

20 collected at different time points after transfection. Next, these 293 cells are infected 
with Ad. AAV (MOI 1, 10, 100 pfu/cell) at specific time points after transfection of Rep 
plasmids depending on the Rep expression kinetics (e.g. 3, 6, 12, 24 . . . hours after 
transfection). It is important to time Ad.AAV infection exactly because viral DNA 
replication must be taking place or finished before Rep production reaches peak levels. 

25 In general, adenovirus DNA replication in 293 cells (infected with MOI 10) is maximal at 
18 hours post-infection, followed by production of structural proteins, packaging of viral 
genomes, and breakdown of cellular membrane structures (which is concluded ~36-48h 
p.i.) (Shenk, T., 1996, In B. N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott- 
Raven Publisher, Philadelphia; van der Vliet, B., 1995, In w. Doerfler, et al. (eds.) vol. 2 

30 p. 1-31, Springer- Verlag, Berlin). Viruses are collected 48h after infection and banded 
by CsCl ultracentrifiigation. Viral material from purified bands corresponding to 
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AAd.AAV is lysed, DNAse-treated (to liberate DNA associate Rep) and subjected to 
immunoprecipitation- Western Blot with Rep specific antibodies to detect co-packaged 
Rep. Based on theoretical calculations assuming that two Rep molecules bind per Ad 
genome, ~l-10ng Rep proteins is expected from Lysates of 10 10 particles, which is within 
5 the range of detectability by Western Blot. Alternatively, co-packaged Rep may be 
detected based on its functional activity to mediate rescue and site-specific integration of 
the AAVITR transgene cassette. To test whether functional Rep protein is co-packaged 
into hybrid vector particles, CsCl purified AAd.AAVl particles generated in 293 cells co- 
expressing Rep after Ad/AAV 1 infection (AAd.AAVI+Rep) can be used for transduction 
10 studies. Three days after AAd.AAVI+Rep infection of the human cell line K562, cellular 
DNA is analyzed for AAVS1 -specific integration events by PCR and PFGE. If efficient 
Rep-mediated site-specific integration of excised AAV cassettes is successful, then other 
AAd.AAV+Rep hybrid vectors with p-Gal and SNori as transgenes can be produced. 

15 Integration studies with Rev vectors in ervthroid cells: 

The hypotheses behind the rational of a rep-expressing hybrid vector (AAd.AAV-rep) 
are: (1) transient Rep co-expression from AAd.AAV-rep vectors can enhance site- 
specific vector integration in human cells and (2) integration occurs via the AAV ITR(s) 

20 without the rep gene, which is placed outside the AAV cassette, thus eliminating rep 
expression upon vector integration. To test hypothesis 1, transduction frequencies of 
AAd.AAV/rep versus AAd.AAV vectors can be compared based on the formation of 
G418 resistant colonies and quantify site-specific integration events at different time 
points after infection of human and mouse cells by PFGE and PCR. To test hypothesis 2, 

25 the structure of integrated vector in transduced cell populations and single clones can be 
delineated by Southern analysis and by sequencing of vector/chromosomal DNA 
junctions. These studies can be performed with AAd.AAV-rep,AAd.AAV, and 
AAd.AAV+Rep (copackaged protein) in human K562 or HEL (for AAVS1 integration) 
and mouse MEL cell lines. 

30 
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Cells infected with AAd.AAV-SNori, AAd.AAV-SNori+Rep or AAd.AAV-Snori-rep can 
be subjected to G418 selection. The number of G418 resistant colonies determined after 
4 weeks of selection in relation to the number of initially infected cells. The selection 
process for colonies that did not survive continued selection due to potential rep-mediated 
5 cytotoxcicity or episomal vector expression can be monitored. If rep expression from 
AAd.AAV-SNori rep does not affect cell viability and proliferation, then more G418 
resistant colonies should appear in AAd.AAV-SNori-rep and AAd.AAV-Snori+Rep. The 
structure of integrated vector can be determined by Southern Blot and sequencing of 
integration junctions. 

10 

To uncover a potential selection bias against rep producing cells after transduction with 
AAd.AAV/rep, site-specific and random vector integration events can be quantitated in 
cellular DNA isolated from cell populations at different time points after infection (e.g. 
0.5, 1, 3, 7, 14 days). To do this, the techniques based on PFGE-Southern can be utilized, 
15 It is expected that the signal(s) for AAVS1 -specific integration in AAd, AAV/rep infected 
human cells increases during the first days after infection and then remains constant over 
time. 



In a separate study, the integration status of vector DNA (analyzed by PFGE or PCR) and 
20 the number of integrated copies (analyzed by Southern Blot) with the expression level of 
P-galactosidase in single clones transduced with P-Gal hybrid vectors (AAd.AAV-BG, 
AAd.AAV-BG+Rep, or AAd.AAV-BG-rep) can be correlated. Together with data 
obtained in the studies described in the Specification, this allows assessment of whether 
transcriptional silencing is associated with site-specific vector integration into the 
25 AAVS1 site. 

It is not clear a priori whether the specific Rep function for vector rescue, 
concatemerization, and integration can efficiently occur in non-S-phase or non-dividing 
cells. To test whether AAd.AAV* , AAd.AAV or AAd.AAV-rep/+Rep vectors can 
30 integrate into non-dividing cells, transduction studies in cell cycle arrested cell cultures 
can be performed as described earlier. 
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DISCUSSION 

The establishment of stable cell lines expressing Rep 68/78 at detectable levels is not 
possible, which is probably due to rep mediated cytotoxicity. Therefore, it is not possible 
to perform long-term transduction studies (e.g. G418 selection or studies in single clones) 
in combination with ectopic rep expression. Moreover, due to the inhibitory effect of rep 
on adenovirus replication, it is currently not possible to generate adenoviral vectors 
expressing rep under the RSV or PGK promoter. 

Taken together, this indicates that co-expressed Rep may stimulate site-specific transgene 
integration. 

E. A Detailed Study Of Trans duction/Integration Of Hybrid Vectors In Ervthroid 
Cell Lines: 

In order to improve transduction and integration frequencies of the hybrid vectors into 
erythroid cell lines, a detailed study comparing various hybrid vectors have to be carried 
out as described below. The transduction studies are performed in K562 cells which is 
considered to be an adequate mode] to study gene transfer vehicles into erythroid cells 
(Floch, V., et aL, 1997, Blood Cells, Mol and Diseases.23, 69-87). The optimal vectors 
should be able to integrate intb the cellular genome with a high frequency, determined by 
Pulse field gel electrophoresis (PFGE) and Southern blot as described in Example 4. In 
addition, the results from the following studies will serve to evaluate whether a given 
hybrid vector needs to be modified for site-specific integration in the host genome. 

Sequencing of integration junctions: 

The ultimate proof for vector integration is the sequencing of junctions between SNori 
vector DNA and chromosomal DNA. Furthermore, this clarifies the question whether the 
AAV ITRs represent the substrate for integration. Specifically, DNA from clones with 
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known AAd.AAVSNori integration structure (analyzed by Southern Blot) digested with 
EcoRI, which does not cut within the SNori cassette. The resulting fragments are 
circularized and transformed into a specific E. coli strain (according to the protocol 
described by Rutledge and Russell (Rutledge, E. A. et al., 1997, Journal of Virology, 
5 71:8429-36)). Kanamycin resistant bacterial clones should contain the integrated SNori 
cassette. Flanking chromosomal DNA in rescued plasmids can be sequenced with 
primers specific to the transgene. 

To confirm vector integration in a small number of transduced cells, genomic DNA is 
10 extracted and digested with EcoRI. EcoRI fragments are ligated to linkers containing a 
specific primer binding site and are then digested with NotI, religated and propagated in 
E. coli. Plasmid DNA from a representative number of bacterial clones is sequenced to 
determine the vector/chromosomal DNA junctions. 

. 15 Dose dependent toxicity: 

In order to test that the transduction frequence is dose-dependent and AAd.AAV vectors, 
which are devoid of all adenoviral genes, could be used to infect cells at higher doses 
with less cytotoxicity than first generation adenovirus, K562 cells are infected with 

20 different MOIs (1-1 0 8 ) of AAd.AAVBG and the first generation vector Ad.AAVBG 
(which contains the same p-Gal expression cassette). At day 4 post-infection, the total 
number of cells, the percentage of viable cells (based on trypan blue exclusion) and the 
percentage of X-Gal positive cells are counted. A fraction of infected cells are quantified 
for P-Gal expression using the Galacto-Light kit. The level of transgene expression is 

25 expected to be comparable between the two vectors. K562 cells are predicted to tolerate 
higher doses of AAd.AAVBG better than Ad.AAVBG which express viral genes. 

Integration frequency with and without G418 selection: 

30 In order to investigate the integration frequency of the different vectors and to confirm 
that AAV ITRs present in double-stranded adenoviral DNA genomes can mediate vector 
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integration with a frequency comparable to rAAV vectors, integration studies are 
performed based on the formation of G418 resistant colonies with AAd.AAVSNori, 
AdSNori, Ad.SNorilTR, and rAAVSNori after infection with 2x1 0 5 and 2x1 0 6 genomes 
per cell (Fig. 8). After infection, cells are plated in 96 well plates under limiting dilution 
and selected with G418 to estimate the frequency of formation of G418 resistant colonies. 
Another set of cells is plated without G418. A representative number of clones (w/ and 
w/o G418 selection) are expanded to >10 6 cells (after 3-4 weeks of culture) and analyzed 
for the presence of viral DNA by Southern Blot as well as PFGE analysis to discriminate 
between episomal vector DNA and vector genomes stably associated with chromosomal 
DNA. This allows us to estimate the integration frequency of the different vectors, to 
assess the effect of G418 selection on integration, and to consider position effects on neo 
expression in calculating the total integration frequency. Integrated vector copies with a 
frequency of at least 1x1 0" 4 is predicted only for AAd.AAVSNori and rAAVSNori. The 
total number of colonies may be lower in both the first generation vectors, 
Ad. AAVSNorilTR and . Ad.SNori, due to the toxic effects of expressed adenoviral 
proteins; however, a higher integration frequency is predicted for the vector containing 
the AAV ITRs (Ad.AAVSNoriITR). 

Kinetics of integration: 

Compared to rAAV, the double-stranded nature of entering AAd.AAV genomes provides 
more protection against degradation. Furthermore, the synthesis of transcriptionally 
active double-stranded intermediates from single-stranded genomes, which is considered 
a limiting step in rAAV transduction, is not required in AAd.AAV transduction. Thus, 
the lag phase between infection and expression seen with rAAV vectors, which is 
causally linked to double-strand synthesis/integration may be shorter or absent in 
infections with AAd.AAV vectors. Furthermore, it was demonstrated earlier that a 9kb 
mini-adenoviral genome packaged into adenoviral particles is only short lived and 
completely degraded by day 3 post-infusion. In contrast, transduction with the 5.5kb 
AAd.AAVl (Figure 1) genome allows for long-term expression, suggesting that either 
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AAV ITRs can stabilize the viral genome as an episome until it is integrated or 
integration occurs shortly after infection. 

The status of vector DNA can be examined in K562 cells at different time points after 
5 infection with AAd.AAVSNori, AdSNori, A&AAVSNorilTR, or rAAVSNori (MOI 
2xl0 5 ). Infected cells are harvested at 1 hour, 5 hours, 1 day, 3, 7, and 14 days after 
infection and chromosomal DNA is analyzed by PFGE followed by hybridization with a 
transgene specific probe. This technique allows us to distinguish between episomal 
vector DNA, which appears as a distinct 5.0kb band and integrated DNA. Furthermore, 

10 extra chromosomal high-molecular weight vector concatemers can be detected. In the 
case of random integration, after digestion of chromosomal DNA with I-Ceul or Pl-Scel, 
vector-specific signals in the range of l-2mb should be seen. The intensity of episomal 
and integrated vector signal is quantified for each time point using phosphoimager 
analysis. This gives information about the kinetics of hybrid vector integration in a 

1 5 population of infected K562 cells and the intracellular stability of hybrid vector genomes. 

Structure of integrated vector DNA and integration junctions with chromosomal DNA: 

AAd.AAVl integrates as concatemer/s randomly into host DNA as shown previously. 

20 How many vector copies are present in one concatemer and whether the extent and the 
kinetics of tandem-formation are dose dependent still remain unclear. Another 
unanswered question is how AAd.AAV integrates: whether one or both ITRs are 
involved, whether the integrated ITRs are still intact, and whether adenoviral sequences 
integrate as well. These issues are important for the strategy to include rep genes into the 

25 hybrid vector genome. Moreover, if intact AAV ITRs are present within integrated 
vector copies, helper virus (adenovirus or HSV) infection in vivomay mobilize the 
integrated AAV-ITR vector cassette and affect stability of transgene expression. 

To answer these questions, K562 cells can be infected with AAd.AAVSNori, AdSNori, 
30 AdAAVSNorilTR, and rAAVSNori at MOIs 2xl0 5 , 2xl0 6 , or 2xl0 7 genomes per cell. 
Infected cells are plated in 96 well plates in the presence or the absence of G418. The 
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latter is included because G418 may cause amplification of integrated vector DNA 
(Rutledge, E. A. et al., 1997, Journal of Virology, 71:8429-36). Genomic DNA from 
isolated clones can be analyzed by regular Southern Blot as described in the Examples 
Section to confirm the presence of vector concatemers and calculate the number of 
5 integrated vector copies. More informative is the sequencing of integrated vector copies 
and their junctions with chromosomal DNA. The structure of integration junctions can 
be delineated using the role of AAV ITRs in vector integration and the extent of 
insertional mutagenesis after transduction. This data provides information about the 
potential risks of hybrid vector used in clinical trials. 

Transduction of cell cycle arrested cells: 

The ultimate target for the hybrid vectors described in the Specification are quiescent 
hematopoietic stem cells. We hypothesize that the double-stranded nature of AAd.AAV 

15 genomes and specific nuclear import mechanisms may allow for the transduction of non- 
dividing cells. This is in part supported by the transduction studies with AAd.AAV 1 in 
quiescent hepatocytes in vivo. To confirm this data, primary fibroblasts can be forced to 
enter the Go phase by serum/growth factor starvation before infection with the hybrid 
vectors according to a protocol described by Russell (Russell, D. et al., 1995, PNAS, 

20 92:5719-23). Cells are maintained for three days after infection under serum/growth 
factor deprivation. At this time point, genomic DNA is isolated and analyzed for 
integration events by PFGE in comparison with growing cells. Another series of 
integration studies can be carried out on K562 cells arrested in the Gi/S phase of the cell 
cycle with aphidicolin (added 1 day before and maintained several days after infection 

25 with hybrid vectors - depending on the integration kinetics studies described earlier). To 
investigate whether DNA damaging agents increase the transduction frequency of hybrid 
vectors, ceil-cycle-arrested K562 cells or primary fibroblasts can be treated with 
cisplatinum or 3 H-thymidine prior to virus infection according to a protocol described by 
Alexander and Russell (Alexander, I. E. et al., 1994, J. Virol, 68:8282-87; Russell, D. et 

30 al., 1995, PNAS, 92:5719-23). Furthermore, the effect of chromosomal DNA 
decondensation on the transduction efficiency of hybrid vectors can be studied in arrested 
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cells after treatment with puromycin, staurosporin, Hoechst 3328, distramycin, or 
vandate. 

F. Improvements in AAd.AAV Production and Purification 

To inhibit packaging of full-length genomes a modified form of I-Sce I, a yeast 
mitochondrial intron-endonuclease with a non-palindromic 18-bp recognition sequence is 
expressed in 293 cells. Constitutive expression of this enzyme in mammalian cells is not 
toxic, possibly due to either the lack of I-Scel sites in the genome or sufficient repair of 
them (Rouet P. et al, 1994. PNAS. 91:6064-8). The yeast I-Sce I is modified with an 
SV40 T-antigen nuclear localization signal and an optimal Kozak sequence to enhance its 
functionality in mammalian cells (Rouet P. et al, 1994 . PNAS, 91:6064-8). For another 
yeast endonuclease it was shown that a recognition site within an transduced Ad genome 
was efficiently (30% of all transduced genomes) when expressed in human A549 cells. 
Importantly, the expression of E4 ORF6 and ORF3 expressed from the transduced Ad 
genome inhibited double-strand break repair mediated by the endonuclease (Nicolas, A. 
L. et al, 2000 . Virology, 266:21 1-24). This is consistent with the observations by others 
where these E4 proteins prevent concatemerization of the viral genome (Boyer, J. et al, 
) 999, Virology, 263:307-12). Based on this, packaging of full-length virus containing a 
I-Scel recognition site is reduced in 293 cells constitutively expressing I-Sce I. The 
1 8mer I-Sce site is inserted into the E3 region of the Ad.IR vectors. These vectors are 
generated and amplified in 293 cells followed by a large-scale infection of 293 cells 
expressing I-Scel. Alternatively, an expression cassette for the endonuclease Xhol is 
inserted into the E3 region of Ad.IR or Ad.AAV vectors. The Xhol gene will be 
modified for optimal function in mammalian cells. Vectors expressing Xhol are 
generated and amplified in 293 cells expressing the Xho I isoschizomer PaeR 7 
methyltransferase (PMT) (Nelson, J. E. et al, 1997. J. Virol, 71:8902-7), which mediates 
the addition of a methyl group onto the N6 position of the adenine base of Xho I sites, 
CTCGAG. This protects the viral and cellular genome from Xhol cleavage. Methylated 
Ad vectors are produced at high titers. AAd.AAV vectors are then obtained by large- 
scale infection of 293 cells with the Ad.AAV-XhoI vectors. At this stage the viral 
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genome is not methylated and is digested at the Xhol sites. Xhol sites present within the 
transgene cassette are deleted by site-directed mutagenesis without altering the amino 
acids sequence. (Xhol is accumulated only at. late stages in virus replication and should 
act only upon a large part of Ad DNA when replication is completed. In addition, 
ultracentriftigation optimizes the separation between AAd.IR and AAd.IR particles 
(Blague, C. et al., 2000, Blood* 95:820-8). 

EXAMPLE II 

MODIFIED FIBER PROTEIN 

A. Test the Infectivitv of Different Human or Animal Serotype on Human Bone 
Marrow Cells. 

Since the amino acid sequence of the fiber knob region varies considerably among the 
-50 known serotypes, it is thought that different adenovirus serotypes bind to different 
cellular receptor proteins or use different entry mechanisms (Shenk, T., 1996, In B. N. 
Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott-Raven Publisher, Philadelphia; 
Mathias, P. et al., 1994, Journal of Virology, 68:6811-14; Defer, M., et al., 1993, J. of 
Virology, 64, 3661-3673). Although most adenoviruses contain RGD motifs in the 
penton base proteins, there are a number of serotypes (e.g. Ad 40, 41) without this 
conserved sequence. These types may use integrin Dv-independent pathways for virus 
internalization (Davison, A.J., et al., 1993, J. Mol Biol. , 234, 1308-1316; Mathias, P. et 
al., 1994, Jownal of Virology, 68:6811-14). To test whether other Ad serotypes can 
infect stem cell subpopiilation present in human bone marrow, studies with a series of 
different human Ad serotypes and animal viruses can be performed (see Table II). As a 
means to verify efficient transduction with Ad serotypes, viral DNA is tagged before 
infection and the presence of viral genomes in the nuclei of transduced cells is 
investigated. Furthermore, whether viral DNA is replicated in transduced cells can be 
analyzed as indirect proof for early viral gene expression. A direct detection of expressed 
viral proteins is impossible due to the unavailability of antibodies against all the 
serotypes included in this study. Simultaneously with the infection assay, transduced 
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human bone marrow cells can be analyzed for morphological and immunohistochemical 
features characteristic of HSC or progenitor subpopulations. For retargeting, serotypes 
which are able to infect CD34+ subsets of bone marrow cells at the lowest MOI are 
selected. As the next step, the fiber gene is PCR-cloned from serotypes with potential 
5 HSC/CD34+ tropism and inserted into standard shuttle plasmids for Ad5 vector 
generation replacing the Ad5 fiber gene using an E. coli recombination system (Figure 
9). 

METHODS 

10 

Cells and viruses: 

HeLa (human cervix carcinoma, ATCC CCL-2.2), CHO (Chinese hamster ovary, ATCC 
CCL-61), K562 (human hematopoietic, ATCC 45506), HEp-2 (human larynx carcinoma, 

15 ATCC CCL-23), 293 (human embryonic kidney, Microbix, Toronto Canada) cells were 
maintained in DMEM, 10% FCS, 2 mM glutamine, and Pen/Strep. Culture media for 
CHO cells was supplemented with 200jiM asparagine and 200joM proline. Human 
CD34+-enriched bone marrow cells were purified from peripheral blood after 
mobilization using MiniMACS VS + separation columns (Miltenyi Biotec, Auburn, CA), 

20 according to the manufacturer's instructions. Aliquots were stored in liquid nitrogen. 
Sixteen hours before the experiment, cells were recovered from the frozen stock and 
incubated overnight in IMDM media, supplemented with 20% FCS, 10" 4 M p- 
mercaptoethanol, 100 ^g/ml DNasel, 2 mM glutamine, 10 U/ml IL-3, and 50 ng/ml stem 
cell factor (SCF) or 2 ng/ml thrombopoietin (Tpo). The purity of CD34+ preparations 

25 was verified by flow cytometry and was consistently greater than 90%. 

Flow cytometry: 

Adherent cells (CHO, HeLa) grown in non-tissue culture treated 10 cm dishes (Falcon, 
30 Franklin Lakes, NJ) were detached by treatment with ImM EDTA and washed three 
times with wash buffer (WB), consisting of PBS supplemented with 1% FCS. Cells 
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grown in suspension (K562, CD34+) were washed three times with WB. After washing, 
cells were resuspended in WB at 2 x 10 6 cells/ml. 2 x 10 5 cells were incubated in WB for 
1 h at 37°C with monoclonal antibodies specific for a v -integrins [L230, ATCC: HB- 
8448, (Rodriguez, E., Everitt, E. 1999. Arch. Virol 144:787-795) (1/30 final dilution), 
CAR [RmcB (Bergelson, J. M., et al. 1997. Science. 275:1320-1323; Hsu, K.-R, L., et al. 
1988. J. Virology. 62:1647-1652) (1/400 final dilution)], or BrdU [(Amersham, Arlington 
Heights, IL) (1/100 final dilution)]. Subsequently, cells were washed with WB, and 
incubated with fluorescein isothiocyanate (FITC)-labeled horse anti-mouse IgG 
antibodies [(Vector Labs., Burlingame, CA) (1/100 final dilution)] or phycoerythrin (PE> 
labeled goat anti-mouse IgG antibodies [(Calbiochem, La Jolla, CA) 1:100 dilution] for 
30 min at 4°C. After incubation with secondary antibodies, cells were washed two times 
with WB and 10 4 cells per sample were analyzed in duplicate by flow cytometry. 

For the analysis of CD34 and c-kit expression on transduced CD34+-cells and for 
fluorescent activated cell sorting (FACS), purified human CD34+ cells were incubated 
with phycoerythrin(PE)-conjugated anti-CD34 monoclonal antibodies (Becton-Dickinson 
Immunocytochemistry Systems, San Jose, CA) or with PE-labeled anti-CD117 (c-kit) 
monoclonal antibodies (MAb 95C3, Immunotech, Beckman Coulter, Marseille, France) 
according to the manufacturer's protocol followed by flow cytometry analysis. All 
analyses and sortings were performed on a FACStar Plus flow cytometer (Becton 
Dickinson, Franklin Lakes, NJ) equipped with 488nm argon and 633 run HeNe lasers. 
For analysis of c-kit expression and FACS purification of CD34+/c-kit+ cells, SCF was 
not added to the media during culturing of CD 34+ cells. 

RESULTS 

CAR/q y-inteErin expression on test cells: 

It is generally accepted that CD34+ cells possess bone marrow repopulating activity. 
Therefore, we used human CD34+ cells as the target for our studies towards identifying 
Ad serotypes with HSC tropisrn and constructing new viral vectors. Studies were 
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performed on mobilized, CD34-positive, peripheral blood cells from one donor under 
conditions which are known to retain CD34+ cells. in a quiescent stage (Leitner, A., et al. 
1996. BrJ.HaematoL 92:255-262; Roberts, A. W., Metcalf, D. 1995. Blood. 86:1600- 
1605). More than 90% of purified cells were CD34 positive by flow cytometry. 
Furthermore, we included into our Ad tropism studies the cell line K562, which is 
considered to be an adequate model for studying gene transfer into human hematopoietic 
cells (McGuckin, et al. 1996. British Journal of Haematology. 95:457-460). HeLa cells, 
which are readily infectible by Ad5, and CHO cells, which are refractory to Ad5 infection 
(Antoniou, M. et al., 1998, Nucleic Acid Res., 26:721-9), were used as positive and 
negative control cell lines, respectively. 

For Ad5, both, binding to the primary receptor and to a^ 5 and a m p 5 integrins are 
important for high efficiency infection of target cells. The expression of CAR and Oy 
integrins on test cells was analyzed by flow cytometry using monoclonal antibodies 
against CAR (RmcB (Bergelson, J. M., et al. 1997. Science. 275:1320-1323; Hsu, K.-H., 
L., et al. 1988. J. Virology. 62:1647-1652)) and ct v integrins (L230 (Roelvink, P. W., et 
al. 1996. J. Virology. 70:7614-7621)) (Figure 10). As expected, nearly all HeLa cells 
expressed high levels of CAR and a v -integrins, whereas CHO cells lacked significant 
CAR and a v -integrin expression. Fifteen and 77% of K562 cells expressed CAR and cc v - 
integrins, respectively. Only -6% of the CD34+ cells used in our studies expressed CAR 
and 17% were positive for a v -integrins. Notably, the preparation of CD34+ cells 
represents a mixture of different cell types. The absent or low expression of primary and 
secondary Ad5 receptors on non-cycling human CD34+ cells is in agreement with 
previous reports (Huang, S., et al. 1996. J. Virology. 70:4502-4508; Neering, S. J., et al. 
1996.. Blood. 88:1147-1155; Tomko, R. P., et al.. 1997. Proc. Natl Acad: Sci. USA. 
94:3352-3356). 

Infection assay using wild-type Ad5 and K562 cells: 

The presence of viral DNA in the nucleus of infected cells is an indirect means to 
demonstrate efficient virus binding, internalization, and nuclear import. Nuclear 
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localization of the viral genome is a prerequisite for transgene transcription and 
integration. Two techniques are utilized to tag viral DNA for in situ analysis. To 
optimize the infection assay, wild-type Ad5 virus and K562 cells which are permissive 
for Ad5 infection can be used. The first protocol (Challberg, S.S. and Ketner, S. 1981, 
Virology 1 14, 196-209), is based on 32 P-labeling of viral DNA. During amplification of 
wild-type Ad5 and A549 cells, 32 P-phosphate (40nCi/mI) is added to phosphate-free 
medium. After development of CPE, 32 P-tagged virus is harvested, banded in CsCl 
gradients, and titered on HeLa cells according to standard protocols. To simulate the 
conditions for infection of human bone marrow cells, K562 cells are incubated in 
suspension with a MOI of 1, 10, or 100 of 32 P-Ad5 for 2, 4, 6, or 8 hours under agitation 
at 37°C. This covers the time period necessary for adsorption, internalization, and 
nuclear import. After washing, cells are fixed either transferred to microscopy slides 
using cytospin or embedded in paraffin and sectioned (according to protocols from 
VECTOR labs, Burlingham, CA). The latter has the potential advantage that multiple 
consecutive sections (5\im) of the same cell can be analyzed by different methods (e.g. 
for 32 P tagged viral DNA, for specific histological staining, for immunofluorescence), 
which allows for correlating infection with a particular cell type present in the bone 
marrow. Cells are incubated in a Kodak NTB-2 photo emulsion for autoradiography. 
The exposure time can be optimized to minimize background or non-nuclear localized 
signals. A dose and time dependent appearance of nuclear silver grains is expected under 
the optimized conditions. Since 32 P-phosphate can label viral proteins as well, a 
cytoplasmic background signal might appear. To facilitate detection, 
immunofluorescence with HSC specific antibodies on sections can be performed. As an 
alternative method, a BrdU-labeling technique for viral DNA can be used (Lieber, A., et 
al. 1999. J Virol 73:9314-24; Lieber, A. et al., 1996, Journal of Virology, 70:8944-60). 
In this case, different amounts of BrdU are added to the A549 culture medium during 
wtAdS virus propagation. BrdU labeled viral DNA can be detected with monoclonal 
antibodies specific to BrdU. The signal can be enhanced using layers of species-specific 
polyclonal antibodies in combination with biotin/avidin and a fluorescent marker. BrdU 
tagged viral DNA can be detected on cytospins of bone marrow cells together with cell 
surface markers by double or triple immunoflourescence. 
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DISCUSSION 

The interaction of selected Ad serotypes with CD34+ cells was tested. As a result of this 
screening we constructed a first-generation, Ad5-based vector whose fiber was 
substituted with the fiber derived from Ad35. We demonstrated that this capsid 
modification allowed for efficient viral transduction of potential HSCs by the 
corresponding chimeric Ad vectors. 

All tropism and transduction studies were performed with non-cycling CD34+ cells, 
which are thought to include HSGs. The quiescent stage of CD34+ cells purified from 
mobilized blood is important because induction of cell proliferation is associated with a 
loss of the ability to reconstitute hematopoiesis and with changes in the spectrum of 
cellular receptors (Becker, P. S., et al. 1999. Exp, Hematol 27:533-541). It is known that 
treatment of hematopoietic cells with cytokines or growth factors changes the expression 
of specific integrins including a v -integrins, which would ultimately alter the 
susceptibility of cells to Ad infection or may effect viability of infected cells (Gonzalez, 
R., et al. 1999. Gene Therapy, 6:314-320; Huang, S., et al. 1995. J, Virology, 69:2257- 
2263). Another fact that complicates the interpretation of transduction studies is the 
extraordinary heterogeneity of CD34+ cells in regards to morphology and function. 

B. Screening different adenoviruses to establish tropism to HSC. 

The ATCC provides more than 70 different human or animal adenoviruses (see Appendix 
I). A collection of 15 human serotypes and 6 animal adenoviruses (see Table II) are 
selected based on the following criteria: (i) availability of the complete genome sequence 
or fiber sequence from the NIH gene bank (ii) CAR receptor usage absent or unknown, 
(iii) different subgroups, and (iv) moderate or low tumorigenicity (Shenk, T., 1996, In B. 
N. Fields, et al. (eds.), Fields Virology, vol. 2 Lippincott-Raven Publisher, Philadelphia). 
However, any serotype shown in the Appendix hereto can be used for the invention 
described. Animal viruses are included in the infectivity assay because this may provide a 
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means to circumvent the pre-existing humoral immunity against human Ad5 fiber, which 
represents a critical obstacle for clinical trials with Ad vectors. 

METHODS 

5 

Viruses: 

The following human adenovirus serotypes were purchased from the ATCC: 3 (VR-3), 4 
(VR1081), 5 (VR-5), 9 (VR1086), 35 (VR-716) and 41(VR-930). Adenovirus No. VR- 

10 716 was purchased from ATCC labeled as serotype 34, however it was found to be 
serotype 35 upon sequencing of the fiber region. For amplification, the corresponding 
Ads were infected onto HeLa, 293, or HEp-2 cells under conditions that prevented cross- 
contamination. Virus was banded in CsCl gradients, dialyzed and stored in aliquots as 
described elsewhere (Lieber, A., C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). 

15 Plaque titering was performed as follows: Confluent 293 cells plated in 6-well plates 
were incubated for 24 hours with virus in a total volume of 1ml. Two weeks after 
infection, plaques were counted on . cultures overlayed with 1% agarose/MEM/1 0% 
FCS.O 

20 EM studies: 

CsCl-banded Ad stocks were thawed and diluted with 0.5% glutaraldehyde. Grids were 
prepared as described earlier (Mittereder, N., et al. 1996. J. Virology. 70:7498-7509). 
After staining with 2% methylamine tungstate (Nanoprobes, Stony Brook, NY), the 
25 carbon-coated grids were evaluated and photomicrographed with a Phillips 410 electron 
microscope, operated at 80 kV (final magnification 85,000x). For each particular Ad 
serotype, the number of morphologically deficient viral particles per 1 00 was counted in 
five random fields. 

30 RESULTS 



64 



WO 00/73478 \ . PCT/US00/15442 

Electron microscopy: . 

Little is known about the stability of particles from serotypes other than Ad5. Since the 
intactness of viral particles was crucial for comparative interaction studies, virions from 
the serotypes specified above were analyzed by electron microscopy (EM). EM studies of 
negative contrast stained Ad suspensions demonstrated that the percentage of defective 
particles (loss of icosahedral shape or luminal staining) did not exceed 5% indicating that 
serotype preparations had comparable qualities. Representative EM photographs are 
shown for Ads 5, 9, and 35 (Figure 1 1). 

Serotype screening: 

It is thought that different Ad serotypes bind to different cellular receptor proteins and 
use different entry mechanisms (Defer, C, et al., P. 1990. J. Virology. 64:3661-3673; 
Mathias, P., et al. 1994. Journal of Virology. 68:6811-6814). A set of human 
adenoviruses was obtained from the ATCC to be tested for tropism to.CD34+ cells. 
These included serotypes 3, 4, 5, 9, 35, and 41 representing different subtypes (Table 1). 
We believed that these serotypes would use different cellular attachment and 
internalization strategies due to differing lengths of fiber shafts (Chroboczek, J., et al. 
1995. Adenovirus fiber, p. 163-200. In a. P. B. W. Doerfler (ed.), The molecular 
repertoire of adenoviruses, vol. 1. Springer Verlag, Berlin; Roelvink, P. W., et al. 1998. 
J. Virology. 72:7909-7915), the presence or absence of RGD motifs within the penton 
base, arid differing tissue tropism. The relatively little characterized Ad35 was selected 
because it was found in immunocompromised hosts, particularly in bone marrow 
recipients (Flomenberg, P., et al. 1994. Journal of Infectious Diseases. 169:775-781; 
Flomenberg, P. R., et al. 1987. Journal of Infectious Diseases. 155:1127-1134; Shields, 
A. F., et al. 1985 New England Journal of Medicine. 312:529-533). The latter 
observations prompted us to believe that bone marrow cells are among the natural 
reservoirs for Ad3 5 . 
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TABLE II 



Human and animal adenoviruses with potential interest for the invention 



Adeno 
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Human/ 


Human/ 
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Bovi 
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Swin 
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virus 


Group B 


Group 
D 


Group 
F 




ne 


e 
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Sero 


3,7,11,16, 


8,15,17, 


40,41 


CELO, 


3 




5 


4 


1 


type 


21,34,35 


19,28,3 
7 




EDS 













The underlined serotypes use CAR independent pathways for cell entry. 



5 For amplification, the corresponding adenovirus stocks can be infected onto HeLa or 
A549 cells such that at a given time only one virus type is handled in a separate laminar 
flow hood and cultured in Hepa-filtered bottles, preferentially in separate CQ2 incubators 
to avoid cross-contamination. During propagation, viral DNA is tagged using one of the 
techniques described earlier. Viral DNA can be isolated from purified particles. The 
10 Xhol restriction pattern is analyzed for methylated and unmethylated viral DNA by 
Southern blot using the full genome of the corresponding virus type as a radioactive 
probe. 

DISCUSSION 

15 

Although it was reported earlier by slot-blot assay that fiber knobs derived from 2, 9, 4, 
and 41L can bind to CAR (Roelvink, P. W., et al.. 1998. J. Virology. 72:7909-7915), it is 
not clear whether this binding occurs with an affinity that is physiologically relevant and 
whether this would confer cell entry. Furthermore, as shown for the Ad5 interaction 

20 between the penton and intergrins, a secondary receptor is required to induce virus 
internalization. We demonstrated that different serotypes interacted differently with the 
K562 or CD34+ target cells. Ad5, Ad4, and Ad41 were not able to efficiently attach to 
and be internalized by K562 and CD34+ cells. Although Ad4 belongs to a separate 
subgroup (E), it is thought that Ad4 represents a natural hybrid between subgroup B and 

25 C viruses with a fiber related to Ad5 (Gruber, W. C, et al. 1993. Virology. 196:603-61 1). 
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Therefore, it was not surprising that Ad4 has binding properties similar to Ad5. The 
subgroup F serotype Ad41 has been shown to contain distinct fibers, a long shafted and a 
short-shafted fiber allowing for different cell entry pathways (Tiemessen, C. T., Kidd, 
A.H. 1995. J. Gen. ViroL 76:481-497). The Ad41 penton base does not contain RGD 
motifs suggesting that this virus may use a v -intregrin independent pathways for cell 
entry. However, these features did not improve interaction with CD34+ cells. Ad9, Ad3, 
and Ad35 did interact with CD34+ cells more efficiently than Ad5. Out of all the 
serotypes tested, Ad35 demonstrated the most efficient attachment and internalization 
with K562 and CD34+ cells. Although the short-shafted Ad9 can bind to CAR, it 
preferentially uses a v -integrins for cell entry (Roelvink, P. W., et al. 1996. J. Virology. 
70:7614-7621). Therefore, the low level of a v -integrin expression on certain subset/s of 
CD34+ cells may account for the observed susceptibility to Ad9. 

C. Attachment an d Internalization of the Ad serotypes to K562 and CD34+ cells. 

METHODS 

Labeling of Ads with f HI -methyl thymidine: 

Serotypes were labeled with [ 3 H]-methyl thymidine as described in detail elsewhere 
(Roelvink, P. W., et al. 1996. J. Virology. 70:7614-7621). Briefly, 5xl0 7 HeLa or 293 
cells were grown in 175 sq. cm flasks with 15 ml DMEM/10% FCS and infected with 
wild type adenovirus at a MOI of 50 or higher. Twelve hours post-infection, 1 mCi of 
[ 3 H]-methyl thymidine (Amersham, Arlington Heights, IL) was added to the media and 
cells were further incubated at 37°C until complete CPE was observed. Then, cells were 
harvested, pelleted, washed once with cold PBS, and resuspended in 5 ml PBS. Virus was 
released from the cells by four freeze-thaw cycles. Cell debris was removed by 
centrifiigation and viral material was subjected to ultracentrifugation in CsCl gradients 
and subsequent dialysis as previously described (Lieber, A., C.-Y. et al. 1996. Journal of 
Virology. 70:8944-8960). Virus purification and dialysis removed unincorporated 
radioactivity. Wild type Ad particle concentrations were determined 
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spectrophotometrically by measuring the OD260> utilizing the extinction coefficient for 
wild-type Ad5 e 260= 9.09 x 10* 13 OD ml cm virion" 1 (Maizel, J. V., et al. 1968. Virology. 
36:115-125). The virion specific radioactivity was measured by a liquid scintillation 
counter and was always in the range of 1 x 10' 5 to 1 x 10" 4 cpm per virion. For selected 
5 variants, the fiber gene was PCR amplified and sequenced to ensure identity and the 
absence of cross-contamination. 

Viral DNA tagged with methvlase and test for replication by genomic Southern blots: 

10 To ultimately confirm transduction, a protocol to detect adenoviral replication in infected 
cells can be established. Viral DNA synthesis can only occur after de novo expression of 
adenoviral early genes. A site-specific methylation strategy is utilized to monitor viral 
DNA replication within infected cells (Nelson, J. et al., 1997, Journal of Virology, 
71:8902-07). Methylation marked adenovirus can be produced by the addition of a 

15 methyl group onto the N6 position of the adenine base of Xhol sites, CTCGAG, by 
propagation of the virus in HeLa or A549 cells expressing the Xhol isoschizomer PaeR7 
methyltransferase (PMT) (Kwoh, T.J., et al., 1986, Proc. Natl Acad. Sci. USA 83, 7713- 
7717). It is known that methylation does not affect vector production but does prevent 
cleavage by Xhol. Loss of methylation through viral replication restores Xhol cleavage 

20 and can be detected by Southern blots of genomic DNA from infected cells in 
comparison to native, non-methylated, viral genomes. 

Attachment and internalization assays: 

25 These studies were performed based on a protocol published elsewhere (Wickham, T. J., 
et al. 1993. Cell. 73:309-319). In preliminary experiments, we found that labeled Ad5 
virions reached equilibrium in attachment to HeLa cells after 45 min at 4°C with an MOI 
of 400 pfu per cell. For attachment studies, 3.5 x 10 5 cells were incubated for one hour on 
ice with equal amounts of [ 3 H]-labeled adenovirus OD particles equivalent to an MOI of 

30 400 pfu/cell for Ad5 in 100 |il of ice-cold adhesion buffer (Dulbeco's modified Eagle's 
medium supplemented with 2 mM MgCl 2 , 1% BSA, and 20 mM HEPES). Next, the cells 
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were pelleted by centrifugation for 4 min at 1000 x g and washed two times with 0.5 ml 
ice-cold PBS. After the last wash, the cells were pelleted at 1500 x g, the supernatant was 
removed, and the cell-associated radioactivity was determined by a scintillation counter. 
The number of viral particles bound per cell was calculated using the virion specific 
5 radioactivity and the number of cells. To determine the fraction of internalized [ 3 H]- 
labeled adenoviral particles, cells were incubated on ice for one hour with the 
corresponding virus, washed with PBS as described above, resuspended in 100 fil 
adhesion buffer, and then incubated at 37°C for 30 min. Following this incubation, cells 
were diluted 3-fold with cold 0.05% tiypsin-0.5mM EDTA solution and incubated at 

10 37°C for an additional 5-10 min. This treatment removed 99% of attached radioactivity. 
Finally, the cells were pelleted at 1 500 x g for 5 min, the supernatant was removed, and 
the protease-resistant counts per minute were measured. This protocol minimizes the 
possibility that the internalization data were affected by receptor recycling (Rodriguez, 
E., Everitt, E. 1999. Arch Virol. 144:787-795). Nonspecific binding of Ad particles to 

15 cells on ice was determined in the presence of 100-fold excess of unlabeled virus. This 
value routinely represented less than 0.1 % of viral load. 

RESULTS 

20 Attachment of Ad yarticles to target cells and internalization: 

The selected serotypes were metabolically labeled with [ 3 H]-thymidine, which is 
incorporated into viral DNA during replication. Adsorption and internalization can be 
experimentally dissociated by taking advantage of the observation that at low temperature 

25 (0-4°C) only virus cell attachment occurs, whereas internalization requires incubation at 
higher temperatures. The number of particles adsorbed or internalized per cell was 
calculated using the virion-specific radioactivity and used to quantify interaction of Ads 
3, 4, 5, 9, 35, and 41 with CD34+, K562, HeLa and CHO cells (Figure 12). The serotypes 
varied significantly in their ability to attach to and to be internalized by the different cell 

30 lines. For Ad5, the degree of attachment to the cell lines tested correlated with the level 
of CAR expression. In CHO cells, which were previously shown to be refractory to Ad5 
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infection, the level of attachment and internalization was about 50-70 viral particles per 
cell. This number was hereafter assumed negative in terms of susceptibility of a given 
cell type for Ad5. Interaction of the other serotypes with CHO cells was not significantly 
higher indicating that corresponding receptor/s were absent on CHO cells. All serotypes 
5 tested interacted with HeLa cells; with Ad3 and Ad35 being the most efficient variants. 
The presence of distinct Ad3 and Ad5 receptors on HeLa cells was demonstrated 
previously (Stevenson, S. C, et al. 1995. J. Virology. 69:2850-2857). Ads 4, 5, and 41 
did not bind to K562 cells. In contrast, Ad9 as well as the members of subgroup B, Ad3 
and Ad35, efficiently interacted with K562 cells with Ad35 having the highest number of 

10 adsorbed and internalized particles. Compared to Ad5, about 25 times more Ad35 
particles were attached and three-forth of these were internalized by K562 cells. Viral 
interactions with CD34+ cells were generally weaker. Among the serotypes tested, only 
Ad9 and Ad35 were significantly internalized by non-cycling CD34+ cells. 
Internalization of Ad9 and Ad35 was, respectively, four and eight times more efficient 

15 than for Ad5 particles. The number of Ad35 virions internalized by CD34+ cells was 
almost half of that seen for Ad5 in HeLa cells, which can be readily infected with Ad5 
based vectors. 

Attachment and internalization of adenovirus serotypes 3. 5. P. 35 and 41 into HeLa* 293 \ 
20 and CHO cells: 

Hela and 293 cells expressing high level of primary and secondary receptors for human 
adenoviruses are used as a positive control for virus attachment and internalization. As a 
negative control CHO cells are used. CHO cells do not express the primary adenoviral 

25 receptor at a detectable level, and are therefore refractory for adenoviral infection. For 
attachment studies, these adherent cell lines are detached from 10 cm dishes with PBS- 
EDTA solution (without Ca2+ and Mg2+), washed three times with ice-cold PBS, 
resuspended in adhesion buffer, and incubated with viruses as described above in the 
Examples section. As expected, all adenoviral serotypes tested are efficiently attached to 

30 and internalized into Hela cells (Table HI) (Figure 13). Adenoviruses serotypes 3, 5, 35, 
41, but not 9, are efficiently attached to and internalized by 293 cells. In contrast, poor 
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attachment and internalization of most adenovirus serotypes are observed with CHO 
cells. The level of attachment on CHO is about 50-70 virus particles per cell for 
adenoviruses serotypes 5 and 41, 115 virus particles per cell for adenovirus type 3 and 
about 180 particles per cell for adenovirus serotypes 9 and 35. For further analysis, 
numbers >300 viral particles per cell are assumed as positive and <70 viral particles per 
cell as negative in terms of susceptibility of a particular cell line for efficient adenoviral 
transduction. 



TABLE m 

Comparative analysis of attachment and internalization of Ad5 and Ad9 
to cell lines, expressing different amounts of CAR and aopintegiins. 



Cell line 


CAR 


auf3-integrin 


Ad9 


Ad5 




expression 


expression 


(attached/ 


(attached/ 








internalized) 


internalized) 


HeLa 


++ 


++ 


426/370 


550/500 


CHO 




++ 


300/300 


70/50 


293 


++ 


++ 


20/20 


1950/1750 


Y79 


+++ 




190/140 


1200/1100 


K562 




+ 


320/230 


60/50 


Erythrocytes 


? 


? 


420/- 


68/- 



Attachment and internalization of adenovirus serotypes 3. 5. 9. 35 and 41 into human 
CD34+ bone marrow cells and K562 ervthroleukemia cell line: 

Previous studies showed that the human erythroleukemia cell line K562 can be 
transduced with Ad5-based adenoviral vectors at very high MOIs. As shown in Figure 
14, only about 60 viral particles per cell of adenovirus serotype 5 are attached to and even 
fewer particles are internalized into these cells at a MOI of 400. In contrast to Ad5, about 
320 viral particles per cell of Ad9 and about 1500 viral particles per cell of Ad35 are 
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attached to and about two-thirds of them are internalized into K562 cells (Figure 14B). 
Human unstimulated CD34+-enriched bone marrow cells obtained from frozen stocks are 
incubated overnight in growth medium without cytokine stimulation. The next day, the 
number of viable cells is calculated. For attachment studies, cells are washed three times 

5 with ice-cold PBS, resuspended in adhesion buffer and incubated with adenoviruses. 
Among the adenoviral serotypes tested, only adenovirus particle of Ad9 (about 150 viral 
particles per cell) and Ad35 (about 320 viral particles per cell) are able to attach to 
unstimulated CD34+ cells on the level, compared to Ad 5 (only 60 viral particles per 
cell). Four-fifths of these virus particles are able to be internalized by the cells. 

10 Interestingly, upon stimulation of CD34+ cells with GM-CSF and EPO/TPO for two 
weeks, attachment and internalization of Ad9 viral particles are significantly increased 
(up to 300 particles per cell). At the same time, the transient stimulation of cells with 
GM-CSF for two days could not increase the level of viral attachment to the cells. 

15 Based on the above finding that Ad35 serotype is able to attach and internalize into 
CD34+ cells most efficiently among several serotypes tested, serotype Ad35 was selected 
for further studies. As described in Appendix II, a chimeric vector (Ad5 GFP/F35) 
containing the short-shafted Ad35 fiber sequence in an Ad5 capsid was able to target a 
broad spectrum of CD34+ cells in a CAR/integrin independent manner. 

20 

DISCUSSION 

In summary, from all the serotypes tested, Ad9, Ad3 y and Ad35 demonstrated the most 
efficient attachment to and internalization with K562 and CD34+ cells. Based on 
25 adsorption/internalization data, Ad9 and Ad35 as representatives for subgroups D and B 
were selected for further tropism studies. 

D. Characterization of Ad vector replication in K562 and CD34+ cells. 

30 Comparative analysis of Ad5, and Ad9 and Ad34 to infect and to replicate in 293, K562 
and CD34+ cells. The ability of the Ad9 fiber knob domain to recognize the same 
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primary receptor on the cell surface as Ad5 with comparable affinity was described 
earlier. Thus, the finding that Ad9 viral particles can only poorly attach to 293 cells is 
rather unexpected. In order to find out how the attachment and internalization data 
reflect the biological activity of adenoviruses of different serotypes, the stocks of Ad5, 
5 Ad9 and Ad35 are characterized in more detail by electron microscopy, plaque assay on 
293 cells, and quantitative replication assay in K562 and CD34+ cells. 

METHODS 
10 Quantitative replication assay: 

lxl 0 5 CD34+ or K562 cells were infected in 1 00^1 of growth media with different MOIs 
of Ad5, 9, or 35 which had been amplified in 293 cells, expressing the Xhol DNA 
methyltransferase isoshizomer PaeR7 (Nelson, J., Kay, M.A. 1997. Journal of Virology. 

15 71 :8902-8907). After 2 hours of incubation at 37°C, the cells were centrifuged at 1000 x 
g for 5 min, the virus-containing medium was removed, the cells were resuspended in 
lOOjil of fresh media, and then they were incubated at 37°C until harvesting. At 16 hours 
post-infection for K562 cells, or 36 h post-infection for CD34+ cells, 5 jag of pBS 
(Stratagene, La Joila, CA) plasmid DNA was added as a carrier which could also be used 

20 as a loading control. Genomic DNA was extracted as described previously (Lieber, A., 
C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). One-fourth of purified cellular 
DNA (equivalent to 2.5 x 10 4 cells) was digested with Hindlll, Xhol, or with Hindin and 
Xhol together at 37°C overnight and subsequently separated in a 1% agarose gel followed 
by Southern blot with chimeric Ad5/9 or Ad5/35 DNA probes. The chimeric probes, 

25 containing sequences of Ad5 and Ad9 (Ad 5/9) or Ad5 and Ad35 (Ad 5/35), were 
generated by a two-step PCR amplification using Pfu-Turbo DNA polymerase 
(Stratagene, La Jolla, CA) and viral DNA from purified particles as a template. The 
following primers were used for PCR (Ad5 sequences and nucleotide numbers are 
underlined): Ad5Fl - (nt: 32775-32805^ 5'- GCC CAA GAA TAA AGA ATC GTT 

30 TGT GTT ATG- 3': Ad5Rl - fnt: 33651-3362H 5'- AGC TGG TCT AGA ATG GTG 
GTG GAT GGC GCC A -3»; chimeric Ad5/9F - (nt: 31150-31177, nt: 181-208 ) 5' -AAT 



73 




WO 00/73478 i PCT/US00/15442 

GGG TTT CAA GAG AGT CCC CCT GGA GTC CTG TCA CTC AAA CTA GCT 
GAC CCA -3'; chimeric Ad5/9R - (nt: 32805-32775. nt: 1 149-1 1 13) 5'- CAT AAC 
ACA AAC GAT TCT TTA TTC TTG GGC TTC ATT CTT GGG CGA TAT AGG 
AAA AGG-3; chimeric Ad5/35F - (nt: 31150-31177 . nt: 132-159) 5'- AAT GGG TTT 

5 CAA GAG AGT CCC CCT GGA GTT CTT ACT TTA AAA TGT TTA ACC CCA-3', 
chimeric Ad5/35R (nt: 32805-32775, nt: 991-958) 5 '- CAT AAC ACA AAC GAT TCT 
TTA TTC TTG GGC ATT TTA GTT GTC GTC TTC TGT AAT GTA AG-3\ 
Nucleotide numbers are given according to the sequences obtained from the NCBI 
GenBank (accession No. M73260 / M29978 for Ad5, X74659 for Ad9, and U 10272 for 

10 Ad35). After the first amplification, the 968 bp-long Ad9, a 859 bp-long Ad35 DNA 
fragments corresponding to the fiber genes, and a 876 bp-long Ad5 fragment 
corresponding to the Ad5 E4 region (located immediately downstream of Ad5 fiber gene) 
were purified by agarose gel electrophoresis. To generate chimeric DNA probes, 
amplified Ad5 DNA was mixed with Ad9 or Ad35 fragments obtained during the first 

15 step of PCR. and subjected to a second PCR amplification using Ad5/9F or Ad5/35F 
primers and the Ad5Rl primer. The resulting Ad5/9 or Ad5/35 chimeric DNA fragments 
(see Fig. 15) were purified and their concentrations were measured 
spectrophotometrically. Corresponding chimeric DNA fragments were loaded as 
concentration standards on agarose gels or labeled with [ 32 P]-dCTP and used as probes 

20 for Southern analysis. The number, of viral genomes per DNA sample was calculated 
after quantitative Phospho-imager analysis. In preliminary experiments, no preferential 
hybridization of chimeric DNA probes to DNA of any particular viral serotype was 
detected. 

25 RESULTS 

Re plication of selected serotypes in K562 and CD34+ cells: 

Adsorption/internalization studies do not ultimately prove viral transduction, a process 
30 often defined as gene transfer that allows for viral or heterologous gene expression in 
host cells. Intracellular trafficking, including endosomal lysis, transport to the nucleus, 
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and nuclear import of the viral genome, depends on structural capsid proteins and thus, 
varies between different serotypes (Defer, C, et al, P. 1990. J. Virology, 64:3661-3673; 
Miyazawa, et al. 1999. J. Virology, 73:6056-6065). We believed that analysis of viral 
gene expression would be a means to verify successful nuclear import of viral genomes 
5 and that this would be a good criterion for selection of serotype/s able to efficiently infect 
our target cells. To do this, we used a protocol, which allows for the detection of Ad 
replication in infected cells. Viral DNA synthesis can only occur after de novo expression 
of adenoviral early genes. We utilized a site-specific methylation strategy to monitor viral 
DNA replication within infected cells (Nelson, J., Kay, M.A. 1997. Journal of Virology, 

10 71:8902-8907). Methylated Ad serotypes were produced by the addition of a methyl 
group onto the N6 position of the adenine base of Xho I sites, CTCGAG, during 
propagation of the viruses in 293 cells expressing the Xho I isoschizomer PaeR 7 
methyltransferase (PMT) (Kwoh, T. J., et al. 1986. Proc.NatiAcad.Scl USA, 83:7713- 
7717) (293 PMTcells). Loss of methylation through viral replication restores Xho I 

15 cleavage and can be detected by Southern blots of Xho I-digested genomic DNA from 
infected cells. 

Ad replication studies were performed in K562 and CD34+ cells with Ad9 and Ad35, in 
comparison to Ad5. For replication studies, the infectious titer (in pfu/ml) and genome 

20 titer (in genomes per ml) were determined (by plaque assay on 293 cells or by 
quantitative Southern blot, respectively) for methylated and unmethylated Ad5, Ad9, and 
Ad35 (Table 2). The ratio of pfu to genome titer was comparable for methylated and 
unmethylated virus demonstrating that DNA methylation had not altered transduction 
properties. About 85% of (Ad5, 9, and 35) virus used for infection was methylated as 

25 calculated based on the intensity of fragments specific for methylated and non- 
methylated viral DNA present in the viral load (Fig. 15). The numbers of genomes 
detected after adsorption (1 hour, 0°C) or internalization (2 hours 37°C) correlated well 
with studies shown in Fig. 12. Ad9 and Ad35 interacted more efficiently than Ad5 with 
K562 and CD34+ cells. Dose-dependent replication studies in K562 and CD34+ cells 

30 were performed with the same genome numbers of Ad5, 9, and 35 (Fig. 15). The 
replication rate was measured based on the ratio of methylated to demethylated viral 
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DNA after infection with different MOIs (2100, 420, and 105 genomes per cell). In K562 
cells, efficient replication (100% conversion from methylated to unmethylated DNA) was 
detected for Ad5 at MOI >/= 2100, for Ad9 at MOI >/==420, and for Ad35 at MOI 
>/=105. This demonstrated that Ad35 transduced K562 cells with the highest efficiency. 
5 In CD34+ cells, the replication rate was 100% for Ad5 and 31% for Ad9 after infection 
with MOI 420. Although methylated Ad35 viral DNA was present in CD34+ cells, viral 
replication was undetectable for Ad35. In summary, while viral replication studies in 
K562 cells confirmed data obtained for Ad5, 9, and 35 adsorption and internalization, 
there was a discrepancy between earlier results and the poor replication of Ad9 and, 
10 particularly, Ad35 in CD34+ cells. As outlined later, replication analysis in 
heterogeneous cell populations, like CD344- cells, may not allow for definitive 
conclusions on tropism of a particular serotype. 

Taking all the screening data together, Ad9 and Ad35 emerged as the variants with the 
15 strongest tropism for K562 and CD34+ cells. It is thought that Ad9 can bind to CAR, 
however, it preferentially uses Ov-integrins for cell entry (Roelvink, P. W., et al. 1996. J. 
Virology. 70:7614-7621). This entry strategy may be not optimal for efficient infection of 
CD34+ cells as only less that 17% of them express Ov-integrins (Fig. 10). Therefore, we 
decided to concentrate on Ad35 as a source for heterologous fiber to be used for 
20 construction of a chimeric vector based on an Ad5 backbone. 

TABLE IV 

Results from the infectivity assay which determines the optical 
particle-to-PFU (OPU/PFU) ratio using 293 cells 

25 



Virus 


OFU(A260) 


PFU 


OPU/PFU ratio 


Ad5 


1.4 xlO 12 


1.06x10" 


13 


Ad9 


4.61 x 10" 


2.6x10" 


1773 



DISCUSSION 
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Viral replication studies in K562 cells confirmed the data obtained for Ad5, 9, and 35 
adsorption and internalization. However, there was a discrepancy between the interaction 
data and the replication data in CD34+ cells where Ad9 replicated only poorly and no 
replication was seen for Ad35. Ad replication is only initiated upon the production of a 
5 critical threshold of early viral proteins, which in turn, is directly dependent on the 
number of viral genomes present in the nuclei of infected cells. Therefore, the outcome of 
replication studies may be affected by the rate of nuclear import of viral genomes, by the 
activity of viral promoters, and/or the intracellular stability of viral DNA/RNA. These 
parameters may vary, on one hand, between different subsets of CD34+, and/or, on the 
10 other hand, between different Ad serotypes. In conclusion, the viral replication analyses 
performed with different Ad serotypes in CD34+ cells may not predict the actual 
transduction properties of chimeric vectors based on Ad5 backbone. This implies that 
attempts to produce gene transfer vectors based on Ad genomes other than Ad5 should be 
exercised with caution. 

15 

Recently, an Ad serotype screening strategy was used to identify variants with tropism 
for primary fetal rat CNS cortex cells or human umbilical vein endothelial cells. The 
optimal serotype (Ad 17) was selected based on immunohistochemistry for hexon 
production 48 hours after infection (Chillon, M. s et al. 1999. J. Virology. 73:2537-2540). 

20 However, this approach is problematic because, at least in our hands, antibodies 
developed against Ad5 hexon did not cross-react with other serotypes. Also, hexon is 
expressed only after onset of replication. As outlined above, the kinetics of intracellular 
trafficking, viral gene expression, and replication significantly vary between serotypes 
(Defer, C, et al., P. 1990. J. Virology. 64:3661-3673; Miyazawa, et al. 1999. J. Virology. 

25 73:6056-6065). 

In addition to being the most efficient serotype in terms of interaction with CD34+ cells, 
Ad35 is also interesting because it interacts with receptor/s different from the Ad5 and 
Ad3. Ad35 and Ad5GFP/F35 attachment was not inhibited by Ad5 or anti-CAR 
30 antibodies suggesting that Ad35 binding was CAR independent. First, Ad5 did not 
compete with Ad35 and Ad5GFP/F35 during internalization and infection indicating that 
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« m P3/5- integrins are not involved in viral entry. Second, function-blocking antibodies 
against a v -integrins did not compete with Ad35 and Ad5GFP/F35 for internalization into 
K562 cells, whereas these antibodies did inhibit Ad5 internalization. And third, in 
contrast to Ad5 based vectors, GFP expression after infection with Ad5GFP/F35 was not 

5 restricted to a v -integrin-expressing CD34+ cells. From these facts, we conclude that 
infection with Ad35 and the chimeric Ad5GFP/F35 vector does not involve a v -integrins. 
In this context, the presence or absence of RGD motifs within Ad35 penton base remains 
to be determined by sequencing the corresponding genome region. Cross-competition 
assays demonstrated that Ad35 and Ad5GFP/F35 bind to a receptor that is different from 

10 the Ad3 receptor. Although Ad3 and 35 belong to the same subgroup, they have been 
divided into two DNA homology clusters, Bl arid B2; the amino acids composing their 
fibers are only 60% homologous. Furthermore, the target tissues for both viruses are 
different; Ad3 can cause acute respiratory infections, whereas Ad35 is associated with 
kidney infection (Horwitz, M. S. 1996. Adenoviruses, p. 2149-2171. In B. N. Fields, 

15 Knipe, D.M., Howley, P.M. (ed.), Virology, vol. 2. Lippincott-Raven Publishers Inc., 
Philadelphia). Therefore, it was not surprising to see that Ad3 and Ad35 recognize 
different receptors. 

In conclusion, Ad35 and the chimeric vector enter the cells by a CAR- and a v -integrin 
20 independent pathway. We believe that Ad35 and the chimeric vector binds primarily to 
its fiber receptor and that this interaction is sufficient to trigger internalization. On the 
other hand, Ad35 internalization may involve cellular proteins other than a v -integrins. 
These membrane proteins can overlap with those for Ad3 internalization and represent P2 
integrins, which protrude more from the cell surface than a v -integrins (Huang, S., et al. 
25 1996. J. Virology. 70:4502-4508). 

According to EM studies of negative contrast-stained adenoviral suspensions, the 
percentage of deficient particles for all adenoviral serotypes tested does not exceed 5%. 
However, plaque assays reveal that the ability to form plaques in 293 cells is significantly 
30 different for tested serotypes. The optical particle-to-PFU (OPU/PFU) ratio obtained is 
13 for Ad5, which is in good agreement with the previously estimated ratio for this 
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adenoviral serotype. Importantly, this ratio is about three times higher for adenovirus 
serotype 35 and more than 150-fold higher for adenovirus serotype 9. Furthermore, 
quantitative Southern blot using chimeric Ad5/9 and Ad5/35 DNA probes is used to 
determine the ratio between the genome and transducing titer. This study confirms the 
5 data obtained by plaque assay. Quantitative replication assay of these adenoviruses in 
K562 and CD34+ cells also confirms the ability of Ad9 and Ad34 to more efficiently 
attach to these cell types. The replication of viral genomes is observed for Ad9 and Ad34 
at lower MOIs of infection, compared to Ad5. In conclusion, the data obtained for 
different serotypes in attachment and internalization are in good agreement with the 
1 0 infecti vity data in target cells. 

E. Attachment and internalization of different adenoviral serotypes into primary 
dendritic cells, JAWSII, MCF-7 and REVC cells. 

15 As a proof of principle, the serotype screening strategy can be employed for other 
important target cells which are refractory to Ad5 infection. 

RESULTS 

20 RECV cells are endothelial cells which have to be targeted for approaches that are aimed 
to gene therapy of restenosis, atherosclerosis, inflammation etc. MCF-7 cells are breast 
cancer cells isolated from liver metastases which are important targets for tumor gene 
therapy. The human adenovirus serotypes 3, 5, 9, 35 and 41 are tested to see whether 
they can attach to and can be internalized by mouse primary dendritic cells, JAWSII 

25 cells, MCF-7-human breast cancer cells and REVC endothelial cells. None of the 
adenoviral serotype tested can efficiently attach to primary dendritic cells. Adenovirus 
serotype 3 is able to efficiently attach to REVC endothelial cells (about 400 virus 
particles per cell are attached and about 300 are internalized). In comparison, only 50 
Ad5 particles are able to attach to and even fewer are internalized in these REVC. The 

30 human breast cancer cells (MCF-7) are previously shown to be refractory to Ad5 



79 




WO 00/73478 \ \ PCT/US00/1S442 

infection at low MOls. However, Ad3 and more efficiently, Ad35 attach to and 
internalize into MCF-7 cells. 

DISCUSSION 

5 

The data presented herein indicate that different human adenovirus serotypes recognize 
different cellular receptors and can therefore infect cell types that are refractory to Ad5 
infection. There are adenoviral serotypes that can more efficiently attach and internalize 
than Ad5 for human CD34+ cells, REVC, K562 and MCF-7 cells. This finding provides 
10 a basis for the construction of chimeric adenoviral vectors which are Ad5 vectors 
containing receptor ligands derived from other serotypes. 

F. Infection studies on primary human bone marrow cells. 

15 Since established erythroleukemic cell lines do not represent an adequate model for the 
ultimate hematopoietic stem cell that has to be targeted in patients in order to achieve 
long-term reconstitution with genetically modified cells, normal primary human bone 
marrow cells are used for the initial infection/retargeting studies. 

20 RESULTS 

i 

In a first set of tropism studies with different Ad serotypes, whole bone marrow cell 
suspensions can be used without preselection. This is advantageous because the tropsim 
of various adenovirus serotypes or genetically retargeted vectors can be analyzed on a 
25 broad spectrum of progenitor subpopulations representing myeloid, erythroid, 
megakaryocyte, lymphoid, dentritic, and monocytic lineages. For short term (< 5 hours) 
infection studies, bone marrow suspensions can be cultured in IMDM supplemented with 
10% FCS, 3-mercaptoethanol, and lOu/ml IL-3 for ensuring cell viability. 

30 
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Mononucleated cell assays: 

Mononucleated bone marrow cells can be incubated with MOI 1, 10, 100, or 1000 
pfu/cell of the various adenovirus types for a short time. Paraffin sections or cytospins of 

5 infected bone marrow cells can be analyzed for nuclear-localized, labeled viral DNA. 
BrdU labeling can be visualized by immunoflouresence with anti-BrdU antibodies; 32 P- 
tagged viral DNA can be detected by incubation with photo-emulsion. In addition,, the 
same cell material can be analyzed for morphology after specific histo-staining (e.g. 
Wright-, Hemo3 staining). If required, commercially available antibodies can be used to 

10 specific cell surface markers conjugated directly to different fluorochromes (FITC 
(green), TRIT. ? RPE, (red), RPE-Cy5, AMCA (blue)) to completely characterize infected 
bone marrow subpopulations. Colocalization of BrdU-labeled viral DNA (e.g. as FITC 
signal) with membrane markers signifying infection of specific cell types can be 
demonstrated; for example, potential stem cells/early progenitors (CD34*, CD38"), 

15 megakaryocytes (CD41a+), eryhthroid cells (glycophorin A+), dentritic cells (CDla+), 
monocytes (CD14+), or myeloid cells (CD15+), etc. The morphological analysis of 
infected bone marrow subsets gives a first information whether specific adenovirus 
serotypes can target primitive cell types. 

20 DISCUSSION 

Since the different wild-type adenoviruses do not express a uniform marker gene and do 
not integrate and since detection of tagged viral DNA cannot be done on live cells, it is 
not possible, at this point, to characterize infected cells for clonogenic or repopulation 

25 capacities. Therefore, adenovirus serotypes for retargeting studies are selected, based on 
their ability to infect in vitro purified CD34+ cells at low MOIs. This subset of bone 
marrow cells is known to contain long-term reconstituting cells. Infection studies with 
different adenovirus serotypes can be repeated on purified CD34+ cells (cultured in 
IMDM +10% FCS, P-mercaptoethanol, and 10 units/ml BL-3) as described above. 

30 Purification of CD34+ cells can be performed by direct immunoadherence on anti-CD34 
monoclonal antibody-coated plates or on MiniMacs columns as described by 
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Papyannopoulou (Papayannopoulou, T. et al., 1996, Experimental Hematology, 24:660- 
69; Papayannopoulou, T. et aL, 1993, Blood, 81:229). The purity of isolated CDC34+ 
cells ranges routinely from 80-95%. Analgous infection studies can be repeated with 
selected adenovirus types on CD34+/CD38- subsets. 

5 

To confirm productive infection purified CD34+ cells can be infected with selected 
(methylase-tagged) serotypes and analyze viral DNA replication. Cultures of purified 
human bone marrow CD34+ cells can be used for the transduction and integration studies 
as a model for HSCs. 

10 

It was recently demonstrated that HSC activity does exist in CD34-negative human bone 
marrow subsets (Bathia, M. et ah, 1998, Nature Medicine, 4:1038-45; Osawa, M., et al., 
1996, Science, 273:242-5; Goodell, M. et al., 1997, Naiitre Medicine, 3:1337-45; Zanjani, 
E. D. et al., 1998 ? Exp. Hematology, 26:353-60). Lin~CD34"38" cells can be tested in the 
15 retargeting and transduction studies in combination with repopulation assays in SCID- 
NOD mice. 

G. Cloning and insertion of the fiber gene. 

20 METHODS 

PCR-cloninz of the corresponding fiber gene and insertion into Ad5 based shuttle 
plasmids instead of the endogenous ADS fiber: 

25 One or several adenoviruses with tropism to CD34+ or other HSC containing population 
is selected for further studies described herein. The complete coding region for fiber 
varies between l-2kb, depending on the virus type. The fiber encoding sequences can be 
obtained by PCR with Pfu polymerase from viral DNA isolated from purified particles of 
the selected virus types. The corresponding primers can be designed based on the fiber 

30 sequences available from the EMBL gene bank. The PCR products are cloned as Pacl- 
Ball fragment into pCD4 (Figure 10), a shuttle vector for recombination of RecA+ E. 
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coli. In pCD4, the heterologous fiber gene is flanked on both sides with Ad5 sequences, 
which are homologous to regions directly adjacent to the fiber reading frame in Ad5. As 
an Ad5 (shuttle vector) derived template for recombination, pCDl, a pBHG 10 
(Microbix, Toronto, Canada) derivative can be used. The recombination procedure is 
5 performed according to a protocol routinely used for recombinant adenovirus generation 
(Chartier, C, et al., 1996,7. of Virology, 70, 4805-4810). Routinely, 90% of the resulting 
plasmids are accurately recombined. The junctions between the heterologous fiber (X) 
and Ad5 sequences can be sequenced to confirm the accuracy of recombination. The 
resulting plasmid is named pAdSfiberX (pAdS**). The resulting product is used to 
10 generate pAd5**-based Ad. AAV containing the heterologous fiber gene. 

Construction of chimeric Ad vectors: 

For transduction studies, two Ad vectors were constructed: AdSGFP and Ad5GFP/F35, 

15 containing a chimeric Ad5/35 fiber gene. Both adenoviral vectors contained a 2.3kb, 
CMV promoter driven EGFP gene [derived from pEGFP-1, (Clontech, Palo Alto, CA)] 
inserted into the E3 region of Ad5. The EGFP expression cassette was cloned between 
Ad5 sequences 25,191-28,191 and 30,818-32,507 into a shuttle plasmid, which contained 
the E3 deletion described for pBHGlO (Microbix, Toronto, Canada). The resulting 

20 plasmid was named pAdGFP. For the chimeric vector, the Ad5 fiber gene in pAdGFP 
was substituted by an Ad5/35 chimeric fiber gene generated by the two-step PCR 
protocol outlined above. In the first PCR step, three DNA fragments corresponding to i) 
the Ad5 fiber 5'-nontranslated region and the first 132 bp of the fiber tail domain (nt 
30,818-31,174), ii) the Ad35 shaft and knob domains (nt 132-991), and iii) the Ad5 E4 

25 region including the Ad5 fiber polyadenylation signal (nt 32,775-33,651 were amplified 
by Pfu-Turbo DNA polymerase. The following primers were used: for the Ad5 tail, 
Ad5F-2 (nt 30,798-30,825) 5'-CGC GAT ATC GAT TGG ATC CAT TAA CTA-3' and 
Ad5R-2 (nt 31,174-31,153) S'-CAG GGG GAC TCT CTT GAA ACC CAT T-3'; for the 
Ad35 shaft and knob, primers Ad5/35F and Ad5/35R (see above); for the Ad5E4 and 

30 poly A, primers Ad5F-l and Ad5R-l (see above). After 10 PCR cycles, the products were 
purified by agarose gel electrophoresis, combined, and then subjected to a second PCR 
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with primers Ad5F-2 and Ad5R-l. The resulting 2115 bp-long chimeric fiber gene 
contained the Ad5 tail and the Ad35 shaft and knob domains. This product was used as a 
substitute for the Sall/Xbal Ad5 fiber gene containing fragment in pAdGFP. The 
resulting plasmid was named pAdGFP/F35. To generate full-length E1/E3 vector 
5 genomes, pAdGFP and pAdGFP/F35 were inserted in pAdHM4 (Mizuguchi, H., Kay, 
M.A. 1998. Human Gene Therapy, 9:2577-2583) by recombination in Kcoli (Chartier, 
C, E. et al. 1996. Journal of Virology. 70:4805-4810). To do this, the RecA+ Kcoli 
strain BJ5183 was co-transformed with pAdHM4 linearized by Srfl mixed with the Xbal 
fragments containing the GFP genes, the Ad5 or Ad5/35 fiber genes, and the Ad5 

10 homology regions. The resulting recombinants were analyzed by restriction analysis. 
Correct recombinants were amplified in E.coli HB101 and purified by double CsCl 
gradient banding. The plasmids were named pAd5GFP and pAd5GFP/F35. The correct 
structure of the Ad5/35 chimeric fiber gene was confirmed by endonuclease digestion and 
sequencing part of pAd5GFP/F35. To produce the corresponding viruses, pAdSGFP and 

15 pAd5GFP/F35 were digested with Pacl to release the viral genomes and transfected onto 
293 cells as described (Lieber, A., C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). 
Plaques developed 7 to 10 days post-transfection in overlayed cultures. Recombinant 
viruses were propagated in 293 cells and purified by standard methods described 
elsewhere (Lieber, A., C.-Y. et al. 1996. Journal of Virology. 70:8944-8960). 

20 

Hemagglutination assay: 

Twenty-five microliters of serial dilutions of Ad5, Ad35, or chimeric Ad5GFP/F35 
virions in Mcllvaine-NaCI buffer (0.1 M citric acid, 0.2 M Na 2 HP0 4 [pH 7.2], diluted 
25 1:50 with 0.87% NaCl) were loaded onto 96 well plates. To each dilution, 25 of a 1% 
suspension of monkey erythrocytes (in Mcllvaine-NaCI buffer) was added. The 
sedimentation pattern was determined after incubation for 1 hour at 37°C. All tests were 
performed in quadruplicates in at least two independent experiments. 

30 
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Southern blot: 

Extraction of genomic DNA, labeling of DNA fragments and hybridization were 

performed as described earlier (Lieber, A., C.-Y. et al. 1996. Journal of Virology, 
5 70:8944-8960). 

RESULTS 

Construction/Characterization of chimeric fiber: 

10 

Previously, it has been shown that exchanging the fiber knob was sufficient to alter the 
tropism of chimeric Ad vectors (Chillon, M., .et al.. 1999. J. Virology. 73:2537-2540; 
Krasnykh, V., et al. 1998. J. Virology. 72:1844-1852; Stevenson, S. C, et al. 1997. 
J. Virology. 71 :4782-4790). As outlined above, the length of the fiber shaft may critically 

15 determine the entry strategy of a particular serotype. Therefore, we decided to replace not 
only the Ad5 fiber knob but also the shaft. The chimeric Ad5/35 fiber contained the Ad5 
tail (amino acid: 1-44) necessary for interaction with the Ad5 penton base linked to 279 
amino acids from Ad35 including the shaft with 7 P-sheets and the knob (Fig. 16 A). The 
endogenous Ad5 fiber polyA signal was used to terminate transcription of the chimeric 

20 fiber RNA. The combination of the Ad5 capsid including the RGD motif containing 
penton base with a short-shafted fiber could be risky because the natural distance 
between the fiber knob and the RGD motifs was disturbed. The Ad5 fiber was 
substituted by the chimeric fiber sequences based on an E1/E3 deleted Ad vector. This 
vector carried a CMV promoter-GFP reporter gene cassette inserted into the E3 region. 

25 The corresponding chimeric virus (Ad5GFP/F35) was produced in 293 cells at a titer of 
>2xl0 12 genomes per ml. For comparison, an E1/E3 deleted Ad vector containing the 
original Ad5 fiber gene and the GFP expression cassette was generated (Ad5GFP). The 
titer and the ratio of physical to infectious particles was similar between AdSGFP and 
Ad5GFP/F35 indicating that the fiber modification did not significantly alter the stability 

30 and/or growth properties of the chimeric vector. The correctness of the fiber modification 
was confirmed by restriction analysis of the Ad5GFP/F35 viral genome followed by 
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Southern blot hybridization (Fig. 16B), direct sequencing of the fiber-coding region, and 
a functional test for hemagglutination (HA) of monkey erythrocytes. The agglutination of 
erythrocytes is fiber knob-mediated; it is known that Ad5 does not agglutinate monkey 
erythrocytes whereas Ad35 efficiently does (Pring-Akerblom, P., et al. 1998. J. Virology. 
5 72:2297-2304). In HA tests, Ad5GFP/F35 agglutinated monkey erythrocytes with the 
same efficiency as Ad35 at dilutions of up to 1:512. In contrast, no hemagglutination was 
observed with equivalent Ad5 dilutions. This clearly confirmed the functional activity of 
the chimeric Ad5/35 fiber incorporated into Ad5 capsid. 

10 Generation of chimeric adenoviral vectors (Ad. AAV**) with heterologous fiber 
molecules: Adenoviruses with chimeric Ad5-Ad3 fiber are viable and can be produced at 
high titers (Krasnykh, V., et al., 1996, J. of Virology, 70, 6839-6846; Stevenson, C. et 
al., 1997, 7. Virology, 71:4782-90). In order to test whether the fiber substitution 
described herein affects production or stability of adenoviruses, two El -deleted first- 

15 generation, adenoviral vectors are produced with the AAV- □ gal cassette in 293 cells 
using standard protocols. The vector is generated by recombination of pAd.AAV-BG 
(Fig. 17) with pCDl (containing the endogenous Ad5 fiber); the other vector (with 
heterologous fiber) is the recombination product of pAd.AAV pGal and pAdSfiberX 
(pAd5 fx ). Virus from single plaques is amplified on 293 cells. The production yield per 

20 293 cell can be determined by plaque-titering of 293 cell Lysates. It is anticipated that 
the fiber modification will not critically affect the stability of chimeric vectors. Finally, 
bone marrow cells can be infected with the retargeted vectors. Two days after infection, 
live-cell cytometry is performed for p-gal expression using as substrate Fluorescein di-D- 
D-Galactopyranoside (FDG) (Cantwell, M.J. et al., 1996 Blood 88, 4676-4683; Neering, 

25 S. et al., 1996, Blood, 88:1147-55; Fiering, S. N. et al., 1991, Cytometry, 12:291; Mohler, 
W. et al., 1996, PNAS, 93:57) and the infected cells are characterized for morphology and 
surface markers. Before and during infection, bone marrow cells can be cultured in 
IMDM/FCS supplemented with thrombopoietin (Tpo), which supports the survival of 
HSC (Matsunaga, T. et al., 1998, Blood, 92:452-61; Papayannopoulou, T. et al., 1996, 

30 Experimental Hematology, 24:660-69). Alternatively, retargeted vectors can be 
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generated with the AAV-GFP (green fluorescence protein) cassette and perform FACS 
analysis on transduced cells based on GFP and surface marker expression. 

H. Competition studies of chimeric fiber protein Ad5/35. 

5 

Competition studies: 

Cross-competition studies between Ad5> 35, and Ad5GFP/F35 (Fig. 18) for binding and 
internalization were performed in order to investigate in more detail the pathways which 

10 are used by the chimeric vector to infect target cells. Wild-type Ad35 and the chimeric 
vector Ad5GFP/F35 could recognize the same primary receptor as they competed with 
each other for the attachment to K562 cells (Fig. 19A, upper panel). This primary 
receptor is different from that used by Ad5, since neither Ad5 viral particles nor anti- 
CAR monoclonal antibodies (Fig. 19B, upper panel) were able to abrogate Ad35 or 

15 Ad5GFP/F35 binding. In competition studies for internalization, Ad35 and Ad5GFP/F35 
competed with each other with equal efficiency. Ad5 and anti-oc v -integrin monoclonal 
antibodies (L230) (Figs. 19C, D; lower panel) did not inhibit internalization of Ad35 or 
the chimeric virus. To consolidate this data, K562 cells were infected with Ad5GFP and 
Ad5GFP/F35 after prior incubation of cells with anti-CAR or anti-a v -integrins 

20 monoclonal antibodies followed by analysis of GFP-expressing cells. The transduction 
data mirror the results obtained in adsorption/internalization studies. In summary, this 
demonstrated that Ad35 and Ad5GFP/F35 use a CAR and <x v -integrin-independent 
pathway for infection of K.562 cells; the structural elements which account for these 
specific properties are located within the Ad35 fiber and can be transplanted into Ad5 by 

25 fiber substitution, 

Ad3 can efficiently interact with K562 cells (Fig. 12), although Ad3 and Ad35 belong to 
the same subgroup (B), the homology between amino acid sequences of their fibers is 
only about 60%. Therefore, we decided to test whether Ad3 could compete with Ad35 
30 and Ad5GFP/F35 for attachment and internalization (Fig. 20). These studies 
demonstrated that Ad35 binding was not inhibited by Ad3 indicating the use of different 
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receptors. Interestingly, Ad3 slightly inhibited attachment of Ad5GFP/F35 (Fig. 20A, left 
panel). In addition to binding to the receptor common for the Ad35 and Ad5GFP/F35 
fiber, the chimeric capsid (e.g. the Ad5 penton RGD motifs) may also interacts with a 
second cellular receptor that overlaps with elements involved in Ad3 binding. In cross- 

5 competition for internalization, pre-incubation of cells at 37°C with Ad35 and with 
chimeric virus significantly decreased internalization of [ 3 H]-labeled Ad3 (Fig. 20D, right 
panel). In the reverse experiment, Ad3 as competitor decreased the level of 
internalization by 30% for both, Ad35 and the chimeric virus (Fig. 20B, right panel). As 
expected, Ad5 and Ad3 did not compete for adsorption or internalization. As shown 

10 before (Fig. 19B), anti-CAR and anti-a v -integrin antibodies did not block Ad3 interaction 
with K652 cells. In summary, we concluded that Ad35 and Ad5GFP/F35 bind to 
receptor/s different from that of Ad3, although they can use common structural elements 
for internalization, which are different from q v -integrins. 

15 Infection studies with chimeric virus : 

It is established that Ad5GFP/F35 infected K562 cells by a CAR and a v -independent 
pathway. It is possible that this property allows for efficient transduction of non-cycling 
CD34+ cells, which express scarcely CAR and ct v -integrins. To test this, the transduction 

20 properties of AdSGFP and Ad5GFP/F35 vectors were analyzed on CD34+ cells, K562, 
and HeLa cells. Fig. 21 shows the percentage of transduced, GFP expressing cells 
depending on the MOI used for infection. Nearly 100% of HeLa cells were transduced 
with AdSGFP and Ad5GFP/F35 at MOIs of >/=25. More than 95% of the K562 cells 
were transduced with Ad5GFP/F35 at MOIs of >/= 100, whereas the transduction rate 

25 was significantly lower with Ad5 where it increased with the MOI reaching a plateau at 
-70% GFP-positive cells after infection with an MOI of 400. Transduction of CD34+ 
cells was about three fold more efficient with Ad5GFP/F35 than with AdSGFP at all 
MOIs analyzed. Interestingly, at higher MOIs, the transduction rate did not rise 
proportionally with the viral dose and soon reached a plateau indicating that in both cases 

30 only specific subset/s of CD34+ cells were permissive to infection. In order to 
characterize in more detail these specific, permissive subset/s, additional transduction 
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studies were performed. First, the percentage of GFP expressing cells was determined in 
CD34+ fractions that were stained for <x v -integrins or CARs (Fig. 22). The low number of 
CAR positive CD34+ cells complicated accurate co-labeling studies, and there was no 
correlation between CAR expression and the proportion of transduced cells among 
5 CD34+ cells infected with Ad5GFP or Ad5GFP/F35. Interestingly, for AdSGFP, 65% of 
all GFP expressing cells were positive for a v -integrins, whereas less than 22% of GFP 
positive cells infected with the chimeric virus stained positive for D v -integrin expression. 
While only 17% of the whole CD34+ population expressed GFP after AdSGFP infection, 
the percentage of GFP-expressing cells in the CD34+/ Dv-integrins positive fraction was 
10 50%. This indicates that Ad5GFP vector-mediated GFP expression was preferentially 
localized to a v -integrin positive CD34+ subsets, whereas after infection with the 
Ad5GFP/F35 vector, GFP was expressed in a broader spectrum of CD34+ cells with 
most of them being a v -integrin-negative. 

15 Next, transduced cells were simultaneously analyzed for GFP as well as for CD34 and 
CD117 markers. As mentioned before, only about 90% of all cells used in our analysis 
were positive for CD34 at the time of infection, hence the multiparameter analysis for 
CD34 and GFP. A population of CD34+ cells is extraordinarily heterogeneous in 
morphology and stem cell capacity. The subpopulation of CD34+ and CD117+ cells 

20 resembles very primitive hematopoietic cells (Ikuta, K. t Weissman, LL. 1992 Proc. Natl. 
Acad. Sci. USA 89:1502-1506; Simmons, P. J., et al 1994, ExpL Hematology. 22:157- 
165). Fig. 23 summarizes the analyses of GFP expression in correlation with these 
specific stem cell markers. While 54% of cells infected with chimeric vector were 
positive for GFP and CD34+ , only 25% of cells infected with AdSGFP expressed the 

25 transgene and CD34+ marker (Fig, 23A, lower panel). More importantly, based on GFP 
expression, the chimeric virus transduced 80% of c-kit positive cells, whereas the Ad5- 
based vector transduced only 36% (Fig. 23A, middle panel). In an additional experiment, 
CD34+ cells were sorted for CD117 expression prior to infection with AdSGFP or 
Ad5GFP/F35 and, 24 hours post-infection,' GFP expression was analyzed in this specific 

30 fraction (Fig. 23B). This analysis revealed that the chimeric vectors transduced 4 fold 
more CD34+/CD1 17+ than the AdSGFP vector. 
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In conclusion, these results demonstrated that the chimeric Ad5GFP/F35 vector was 
clearly superior to the AdSGFP vector in targeting and transduction of CD34+ cells. 
Furthermore, the data suggest that the spectrum of CD34+ cell subsets permissive for Ad 
5 infection was significantly different for the chimeric vector than for the Ad5 vector. 

Analysis of viral genomes within CD34+ cells infected with the Ad5 and chimeric 
vectors: 

10 So far, the transduction rate of CD34+ cells was measured based on GFP expression after 
infection with AdSGFP and Ad5GFP/F35. Considering the extraordinary heterogeneity of 
CD34+ cells in morphological and functional parameters, GFP may not be expressed in 
all cell types' that were efficiently infected. Reasons for this include that the CMV 
promoter may not be active in all cell types or that the regulation of transgene expression 

15 could differ between subsets on a post-transcriptional or post-translational level. To test 
this, we quantified the number of intracellular (transduced) viral genomes within GFP 
positive and GFP negative fractions of CD34+ cells infected with AdSGFP and 
Ad5GFP/F35. To do this, twenty-four hours after infection, CD34+ cells were sorted for 
GFP positive and GFP negative fractions, which were subsequently used to isolate 

20 genomic DNA together with transduced viral DNA. The number of viral genomes was 
determined by quantitative Southern blot as described for Fig. 15. Per GFP-positive 
CD34+ cell, about 270 copies of the Ad5GFP/F35 viral genome were detected. 
Interestingly, a remarkable 200 copies of the Ad5GFP/F35 viral genome were found per 
GFP-negative CD34+ cell (Fig. 24A and 25). This demonstrated that not all infected cells 

25 expressed GFP and implies that the actual transduction rate was higher than 54% (GFP- 
positive cells). We concluded that the CMV promoter was not active in all transduced 
CD34+ subsets. No AdSGFP vector specific signal was detected within infected CD34+ 
(GFP positive or negative) fractions by Southern blot which had a detection limit of 14 
viral genomes per cell. From this, we can conclude that the vector DNA concentration per 

30 transduced cell was at least 20 times higher for Ad5GFP/F35 than for AdSGFP. 
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AdSGFP DNA was only detectable in DNA samples from infected CD34+ cells by 
Southern blot after prior PCR amplification with vector specific primers (Fig. 24B and 
25). This indicates that the replication deficient Ad5 vector is present but at a very low 
copy number, which may be limited by intracellular genome stability. Using the PCR- 
5 Southern detection method, Ad5 vector DNA was also detected in GFP negative cells, 
supporting that the CMV promoter may not have been the optimal choice for transduction 
studies. It is notable that studies by others on viral genome analyses after infection of 
CD34+ cells with Ad5 vectors were performed only after prior PCR amplification 
(Mitani, K., et al. 1994. Human Gene Therapy. 5:941-948; Neering, S. J., et al. 1996.. 
10 B hod. 88:1147-1155): 

DISCUSSION 

The chimeric Ad5GFP/F35 vector has binding and internalization properties similar to 

15 Ad35. Therefore, the fiber substitution was sufficient to swap ceil tropism from Ad5 to 
Ad35. The Ad5GFP/F35 capsid chimera contained the short-shafted Ad35 fiber 
incorporated into an Ad5 capsid, instead of the naturally occurring long-shafted Ad5 
fiber. During Ad5 infection, interaction between the penton base and intergrins is 
required to induce viral internalization. For this interaction, the length of fiber shaft and 

20 the precise spatial arrangement of knob and RGD motifs are critical for the virus entry . 
strategy. The natural spatial arrangement is disturbed when short-shafted heterologous 
fibers are inserted into the Ad5 capsid. Interestingly, the Ad5/35 capsid chimera allows 
for efficient infection, suggesting that the protruding RGD motives in the Ad5 penton 
base do not affect the interaction with the primary Ad35 receptor. So far, most of the 

25 chimeric viruses were generated by substituting only the Ad5 knob while maintaining the 
long Ad5 fiber shaft (Chillon, M., et al. 1999. J. Virology. 73:2537-2540; Krasnykh, V. 
N., et al. 1996. J.Virology. 70:6839-6846; Stevenson, S. C, et al. 1995. J. Virology. 
69:2850-2857; Stevenson, S. C, et al. 1997. J.Virology. 71:4782-4790). The exception 
was an Ad5/7 chimeric virus (Gall, J., et al. 1996. J. Virology. 70:2116-2123), where the 

30 whole Ad5 fiber was substituted by the short-shafted Ad7 fiber. However, similar to the 
parental Ad5, the Ad5/7 chimera still required <x v -integrins for infection. 
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This Ad5GFP/F35 chimera is the first demonstration that despite the presence of RGD 
motifs within the Ad5 penton, the chimeric virus uses cell entry pathways determined 
primarily by the receptor specificity of the short-shafted heterologous fiber. This does not 
5 exclude that interaction with a secondary receptor may increase binding affinity. The 
latter is supported by the observation that Ad35 and Ad5GFP/F35 slightly differed in 
their ability to compete with Ad5 or Ad3 for binding. It is possible that Ad5/35 
attachment involves, in addition to the high affinity fiber binding, interaction between 
Ad5 capsid proteins (e.g. RGD motifs) and secondary receptor/s that overlap with those 
10 used by Ad3 and Ad5. 

This data indicate that infection with Ad5-based vectors is restricted to a specific subset 
of CD34+ cells. The percentage of GFP expressing cells after AdSGFP infection of 
CD34+ cells reached a plateau at MOIs higher than 100 indicating that only a limited 

15 fraction of CD34+ cells was permissive to Ad5. Also, strong replication of wild type Ad5 
in infected CD34+ cells may be the result of preferential transduction of a specific 
subpopulation of CD34+ resulting in a expression of early viral genes at a level sufficient 
to initiate viral replication. The presence of a specific subpopulation of CD34+ cells 
permissive to AdS-vector infection was suggested by others (Byk, T., et al. 1998. Human 

20 Gene Therapy, 9:2493-2502; Neering, S. J., et al. 1996.. Blood, 88:1147-1155). In the 
present report, we further characterized this subpopulation and demonstrated that AdS- 
based vectors preferentially infected a v -integrin positive CD34+ cells. Integrins 
(includingctv) are thought to be important for homing and trafficking of transplanted 
hematopoietic cells, however little is known about the correlation between ct v -integrin 

25 expression and the differentiation status of hematopoietic cells (Papayannopoulou, T., 
Craddock, C. 1997. Acta Haematol 97:97-104; Roy, V., Verfaillie, CM. 1999. Exp, 
HematoL 27:302-312). There was no clear correlation between CAR and GFP expression 
suggesting that AdSGFP may be able to use another membrane protein as a primary 
receptor. Alternatively, AdSGFP transduction observed at an MOI of 200-400 could be 

30 the result of direct interaction between virus and ct v -integrins triggering internalization, 
which may be the preferred pathway in the absence of CAR (Legrand, V., et al. 1999. J. 
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Virology. 73:907-919). Importantly, infection with the chimeric Ad5GFP/F35 vector was 
not restricted to the a v -positive CD34+ subpopulation. 

Among CD34+ cells, the subpopulation of CD34+ and CD117+ cells resembles very 
5 primitive hematopoietic cells (Ikuta, K., Weissman, I.L. 1992 Proc, Natl. Acad ScL USA. 
89:1502-1506; Simmons, P. J., et al. 1994. Expl. Hematology. 22:157-165). The receptor 
for stem cell factor, CD117 (c-kit) belongs to a tyrosine kinase family. It was previously 
shown that c-kit+, CD34+ cord blood cells contain a high fraction (16%) of 
hematopoietic progenitors (Neu, S., et al. 1996. Leukemia Research. 20:960-971). Early 

10 in ontogeny 34+/CD117+ cells have long-term repopulating activity (Sanchez, M. J., et 
al. 1996. Immunity. 5:513-525). An average of 50-60% of CD34+ cells are reported to be 
CD117 positive (Ikuta, K., Weissman, IX. 1992 Proc. Natl Acad. ScL USA. 89:1502- 
1506; Neu, S., et al. 1996. Leukemia Research. 20:960-971; Simmons, P. J., et al. 1994. 
Expl. Hematology. 22:157-165). In our studies, the chimeric vector expressed GFP in 

15 54% CD34-K cells and 80% . of CD34+/c-kit+ cells. The actual viral transduction rate 
could be even higher because transduced Ad5GFP/F35 vector DNA was also found in 
GFP-negative fractions of infected cells. This indicates that the CMV promoter used to 
drive GFP expression in our vectors was not active in all transduced cells. We selected 
the CMV promoter for transgene expression based on published data demonstrating that 

20 PGK and CMV promoters allowed for efficient transgene expression in CD34 cells 
whereas the HTLV-I and RSV promoter were almost inactive (Byk, T., et al. 1998. 
Human Gene Therapy. 9:2493-2502; Case, S. S., et al. 1999. Proc. Natl. Acad. Set USA. 
96:2988-2993). On the other hand, studies by Watanabe et al. (Watanabe, T., et al. 1996. 
Blood. 87:5032-5039) suggest that the CMV promoter is not active or rapidly silenced in 

25 certain CD34+ subsets. Our data underscore this observation. Considering retroviral 
transduction studies, the retroviral MLV promoter may have been a better candidate for 
transduction studies in hematopoietic cells (Bregni, M., et al. 1998. Gene Therapy. 5:465- 
472). 

30 After having demonstrated that the Ad5GFP/F35 vector efficiently transduced cells 
carrying stem cell specific markers, the next logical step would be to perform colony 
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assays with pre-sorted GFP positive/negative cells. However, this assay is complicated by 
the fact that infection with first generation Ad vectors is cytotoxic and affects the 
formation and growth of progenitor colonies in MC-cultures (Mitani, K., et al. 1994. 
Human Gene Therapy, 5:941-948; Watanabe, T., et al. 1996. Blood. 87:5032-5039). This 

5 side effect is caused by the expression of Ad proteins within transduced cells (Lieber, A., 
C.-Y. et al. 1996. Journal of Virology. 70:8944-8960; Schiedner, G., et al. 1998. Nature 
Genetics. 18:180-183; Yang, Y., et al. 1994. Proc. Natl Acad. Sci. USA. 91:4407-4411). 
Some of these proteins (e.g E4-orf4, pTP, or E3- 11.6k) have pro-apoptotic activity 
(Langer, S. J., Schaak, J. 1996. Virology. 221:172-179; Lieber, A., et al. 1998. J. 

10 Virology. 72:9267-9277; Shtrichman, R., Kleinberger, T. 1998. J.Virology. 72:2975- 
2983; Tollefson, A. E., A et al. 1996 J.Virology. 70:2296-2306). Clearly, this would 
affect the outcome of transduction studies with Ad5GFP/F35, which allows for the 
efficient transfer of viral genomes into CD34+ cells implying significant expression of 
viral proteins. Moreover, recently published data indicate that short-term colony assay 

15 mostly measure mature progenitors and do not represent a rigorous test for transduction 
of potential stem cells. 

A definitive demonstration that Ad5GFP/F35 based vectors can transduce HSC requires 
colony assays or preferably, repopulation assays in SCID-NOD mice. We can perform 

20 these studies with gutless vectors (Steinwaerder, D. S., et al. 1999. J Virol 73:9303-13) 
and integrating DAd.AAV vectors devoid of all viral genes (Lieber, A., et al. 1999. J 
Virol 73:9314-24) generated based on Ad5GFP/F35 chimeric capsids. Alternatively, 
gutless, retargeted vectors could be used to transiently express a retroviral receptor on 
CD34+ cells to increase their susceptibility to infection with retroviral vectors based on 

25 an approach that we have published earlier (Lieber, A., et al.. 1995. Human Gene 
Therapy. 6:5-11). 

Our finding that Ad5GFP/F35 can efficiently transduce hematopoietic cells with potential 
stem cell capacity represents an important step towards stable gene transfer into HSCs 
30 and gene therapy of blood disorders. Furthermore, the virological aspects of this 
invention contribute to a better understanding of adenovirus cell interactions. 
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I; Retargeting of Ad5 based vectors with modified fibers carrying specific ligand 
peptides for HSC and other cell types 

5 Another alternative to make Ad5-capsid-based vectors suitable for HSC gene therapy is 
to incorporate the coding sequence for HSC specific peptides into the HI loop region of 
the Ad5 fiber gene. The modification of the Hi -loop was successfully exercised by 
Krasnykh et al. with a 7 amino-acid long FLAG peptide (DYDDDDK). Using phage 
display peptide libraries (Pascqualini, R. et al. s 1996, Nature, 380:364-66), Renata 

10 Pasqualini (La Jolla Cancer Research Center) reported recently, at the First Meeting of 
the American Society for Gene Therapy, the identification of small peptide ligands 
specific for bone marrow cells. The corresponding sequences encoding these peptides 
can be added to modify the HI loop sequence employing site-directed mutagenesis. 
Optimally, the ligands should allow for the efficient internalization of adenoviral particles 

15 based on a CAR- and integrin independent pathway. Modified adenoviral vectors 
containing the AAVBG cassette can be produced and tested for HSC tropism as 
described above. 

Adenovirus peptide disvlav: 



In order to retarget adenoviruses to any cell type of interest, a strategy is provided which 
involves creating a library of adenoviruses displaying random peptides in their fiber 
knobs as ligands and screening this library for adenovirus variants with tropism to a 
particular cell type in vitro and potentially in vivo. 



The development of the adenovirus peptide display technique is based on the following 
ideas, (i) Although the tertiary structure of the Ad5 fiber knob is known, it remains 
unclear which domains are involved in receptor binding. There are data suggesting that 
receptor-binding domains partially overlap with hemagglutination domains, which are 
30 well characterized for a number of serotypes. Therefore, three intramolecular loop 
regions representing potential receptor binding sites can be substituted by random peptide 
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libraries. Eight amino acid residues in the center of the FG, or GH loops can be 
substituted by octameric random peptides (Figs. 26 and 27). These substitutions will 
replace CAR tropism and allow for infection of refractory cell types, (ii) To synthesize 
the oligonucleotides encoding the peptide library a novel technique to assemble pre- 
synthesized trinucleotides representing the codons for all 20 amino acids is employed. 
This avoids termination codons and assures optimal codon usage and translation in 
human cells. Synthesis of a completely randomized library is possible with all 20 amino 
acids being incorporated with the same probability and a partially randomized library 
with only three (in average) random amino acids substitutions per octamer at random 
positions with a random amino acid to maintain certain critical features of the tertiary 
knob structure while introducing variability. The last model is based on the distribution 
of amino acids present in the hypervariable CDR 1 or 2 region of immunoglobulins, (iii) 
To maintain a representative library size of about 10 10 different octamers per modified 
loop, a new cloning strategy is employed to allow for insertion of the library into the 
wild-type Ad5 genome without introducing additional amino acids at the substitution site 
and without transformation into bacteria. This strategy is based on a "seamless" cloning 
technique available from Stratagen. (iv) In order to produce the library of viruses, viral 
genomic DNA containing the modified fiber sequences is transfected into 293 cells 
without reduction of the library size. This critical step is done by conjugating the viral 
library DNA to carrier Ad5-based adenovirus via poly-lysine to assure 100% transfection 
efficiency. This technique allows for coupling of —1 |ag of plasmid DNA (or -lxl 0 10 
adenoviral genomes) to 10 10 viral particles which can be used to infect 293/cre cells at an 
MOI of 10-100. Importantly, the carrier adenoviral genome has the packaging signal 
flanked by Iox sites preventing the packaging of carrier viral DNA after infection of 293 
cells that express ere recombinase (293/cre). This helper virus system is routinely used to 
produce so-called gutless adenoviruses. Therefore, the virus progeny represents library 
genomes packaged into capsids containing preferentially Ad5 fibers. This is important 
for the next infection step into 293 cells at a MOI of 1 to assure a homogeneous fiber 
population on the capsid where the fibers are encoded by the packaged genome. 

J. Production Of Adenovirus Vectors With Increased Tropism To Hepatocvtes 
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An example of a G-H loop substitution to target Ad5 to hepacytes was successful. 
Preliminary tests demonstrated that two evolutionary conserved regions within the 
malaria circumsporozoite surface protein (CS) termed RI and RII+ mediate specific 
5 interaction with hepatocytes but not with other organs (including spleen, lung, heart and 
brain), nor with Kupffer cells, liver endothelial cells or with other regions of the 
hepatocyte membrane (Cerami, C. et al., 1992, Cell, 70:1021-33; Shakibaei, M. and U. 
Frevert, 1996 , J. Exp. Med, 184:1699-711). These regions are conserved among 
different species including Plasmodium berghei, P. cynomogli, and P. falciparum that 

10 infect mouse, monkey and human hepatocytes, respectively (Cerami, C. et al., 1992. Cell, 
70:1021-33; Chatterjee, S. et al., 1995. Infect Immun., 63:4375-81). Peptides derived 
from RI (KLKQPG) or RII (EWSPCSVTCGNGIQVRIK) blocked CS binding to 
hepatocytes and infection by spprozoites in vivo ((Cerami, C. et al., 1992, Cell 70:1021- 
33; Chatterjee, S. et al., 1995. Infect Immun., 63:4375-81). RI and RII+ peptides were 

15 separately inserted into Ad5-fiber knob (H-I and G-H loop) containing mutation with 
abolished binding to CAR and alpha-v integrins (Kirby, L. et al., 2000. J. Virol; 74:2804- 
13; Wickham, T. J. et al., 1995 . Gene Then, 2:750-6). Based on preliminary data, a 
short-shafted fiber was used so that the virus entry strategy predominantly depends on the 
interaction with the primary (hepatocyte-specific receptor). The hepatocyte-specific 

20 ligands are flanked by short glycine stretches to provide flexibility and embedded into a 
loop formed by two cystines. This is one of the classical strategies to incorporate ligands 
into a protein scaffold (Doi, N. and H. Yanagawa, 1998 . Cell Mol Life Set, 54:394-404; 
Koivunen, E. et al., 1995. Biotechnology (NY), 13:265-70) and to guarantee their 
presentation at the protein surface. The biodistribution of the best variants is tested in 

25 vivo in C57B 1/6 mice based on Southern, blots or PCR for vector DNA in different 
organs. This mouse strain in known to be susceptible to infection with P. berghei 
(Chatterjee, S. et al., 1995. Infect Immun., 63:4375-81). 
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K. Production Of Adenovirus Vectors With Increased Tropism To Tumor Cells 

A similar strategy is to insert two peptides obtained after selection for tumor tropism by 
displaying random peptides on filamentous phages. The first double cyclic peptide 
5 (RGD-4) proved to bind specifically to integrins present on tumor vasculature (Ellerby H. 
M. et al., 1999 , Nat. Med, 5:1032-8). The second peptide targets specific matrix 
metalloproteinases associated with metastatic tumor cells as shown for the breast cancer 
cell line MDA-MB-435 (Koivunen, E. et al., 1999 , Nat. BiotechnoU 17:768-74). 
Tropism-modified vectors are tested in animal models with hepatic metastases derived 
10 from MDA-MB-435 cells (Fig. 28). 

L. Development Of A Peptide Display Technique Based On Adenoviruses 

A synthetic peptide library is described that allows adenovirus vectors to express random 

15 peptides in the G-H loop of the fiber knob domain. The technique of a phage display 
library is optimized to generate a library of adenoviruses displaying random peptides in 
their fiber knob. This library of adenovirus variants is then screened for tropism to a 
particular cell type in vitro and potentially in vivo. The oligonucleotides encoding the 
peptide library employ a novel technique to assemble pre-synthesized trinucleotides 

20 representing the codons for all 20 amino acids. This will end the termination codons and 
assure optimal codon usage and translation in human cells. To maintain a representative 
library size, a new "seamless" cloning strategy that allows for insertion of the library into 
the wild-type Ad5 genome without introducing additional amino acids at the substitution 
site and without transformation into bacteria. Transfection into 293 cells is done by 

25 conjugating the viral library DNA to carrier Ad5-based adenovirus via polylysine to 
assure a 100% transfection efficiency. Importantly, the carrier adenoviral genome has its 
packaging signal flanked by lox sites preventing the packaging of carrier viral DNA after 
infection of 293 cells that express Cre recombinase (293/cre). The library is produced 
with El -positive viruses depleted for CAR and integrin tropism. Only variant that have 

30 successfully infected the cell type of interest will replicate, resulting in de novo produced 
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virus. The sequence of the peptide ligand that conferred the particular tropism will then 
be analyzed in do novo produced virus. 



COMBINATION NOVEL ADENOVIRAL VECTOR AND MODIFIED FIBER 
PROTEIN 

This example describes the following studies which combine the technology of the 
10 integrating adenovirus vector that is devoid of all adenoviral genes with the modified 
fiber protein that retargets the vector to quiescent HSC. 

A. Transduction studies with retargeted vectors in HSC: 

15 In order to transduce quiescent HSC and integrate into chromosomal DNA, retargeted 
AAd.AAV* vectors are tested for reporter gene expression, and vector integration 
simultaneously while analyzing their clonogenic capacity. The modified AAd.AAV* 
hybrid vectors contain genomes devoid of all adenoviral genes (a "gutless" adenovirus 
vector) packaged into Ad5 capsids with modified fibers. Rep may be incorporated into 

20 these AAd.AAV* vectors to allow for site-specific integration into AAVS1. 

Transduction studies: 

Purified human CD34+ cells in IMDM/FCS+IL-3 and SCF are infected with different 
25 doses of AAd.AAV*-BG (1-10 7 genomes per cell). CD34+ cells infected with 
AAd.AAV^-PGal are cultured for 2 days in suspension and sort P-Gal+ cells by FACS 
using FDG as substrate. This determines the infection efficiency. P-gal expressing cells 
are then submitted to clonogenic assays in semi-solid cultures (in two dishes per MOI) in 
the presence of multiple cytokines. (DL-3, SCF, Epo, G-CSF, GM-CSF, IL-7, Tpo). A 
30 first set of semi-solid cultures can be evaluated after 7 days; another set can be analyzed 
after 14 days. Colonies that have formed in semisolid culture can be characterized by 
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light microscopy and subsequently stained with X-Gal staining. Most of the vector 
genomes should remain episomal and can be lost with successive cell divisions. Thus, 
while most cells can be X-Gal positive at day 2 or day 7 after infection, most of the larger 
colonies (analyzed at day 14 p.i.) may not stain homogeneously for p-Gal. A 
5 representative number of X-Gal positive and X-Gal negative colonies can be picked and 
analyzed for episomal and integrated vector DNA. The outcome depends on the MOI 
used for infection and the integration status of the vector. These studies determine 
whether hybrid vectors can infect primitive progenitors. 

10 Detailed characterization of hybrid vector integration: 

CD34+ cells can be infected with AAd.AAV^-SNori (MOI 1-1 0 7 ) and subjected to G41 8 
selection in methyl cellulose (MC) cultures in the presence of growth factors (IL-3 and 
SCF). The resulting colonies are a mixture of mainly myeloid cells. The number and 

15 morphology of G418 resistant colonies can be determined after 2 weeks of selection. 
This strategy may be disadvantageous in that the appropriate stem cell may not divide 
and form G418 resistant colonies under the specific culture condition used. Moreover, it 
may be difficult to perform G418 selection on a population of heterogenous cells, which 
vary in their sensitivity to G418. Therefore, another set of AAd.AAV fx -SNori infected 

20 CD34+ cells can be cultured in methyl-cellulose (+IL-3, SCF) without G418 selection. 
After 2-3 weeks, single colonies can be picked from both (w/ and w/o G418) MC 
cultures, morphologically characterized, and analyzed for integrated vector using the 
modified protocol developed for integration studies in a small number of cells (see Figure 
8). This strategy allows the assessment of whether hybrid vectors integrate into the 

25 genome of CD34+ cells cultured in the presence of growth factors. This study gives us 
an idea about potential position effects affecting neo or jigal expression from integrated 
vector copies and about the structure of the integrated vector and the flanking 
chromosomal regions. 

30 An alternative method to confirm vector integration: 
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Fluorescence in situ hybridization (FISH) analysis, can be performed in individual cells 
from MC colonies. CD34+ cells are cultured in MC in the presence of growth factors to 
induce cell division and subsequently treated with colchicine. Metaphase chromosome 
spreads are analyzed with biotin-ATP labeled probe specific for the (3Gal or SNori gene 
5 and a dioxigenin-UTP labeled probe for the human X-chromosome as an internal control 
(provided by Christine Disteche, University of Washington). Specific hybridization can 
be visualized with corresponding anti-biotin or anti-DIG antibodies labeled with different 
fluorochromes (e.g. FITC and Texas Red). Hybrid vector DNA may integrate as 
concatemers, which would facilitate detection by FISH. This technique allows one to 
10 localize the chromosomal integration sites of hybrid vectors. 

Test transduction into quiescent bone marrow subpopulations: 

Hybrid vectors described so far can be tested to see whether quiescent CD34+ cells can 

15 be stably transduced. To avoid significant cell proliferation, purified CD34+ cells are 
cultured in serum free IMDM supplemented with thrombopoietin (Tpo). Tpo can alone 
support the survival of stem cells without stimulating their active cell proliferation 
(Matsunaga, T. et al., 1998, Blood, 92:452-61; Papayannopoulou, T. et al., 1996, 
Experimental Hematology, 24:660-69). To analyze the proliferation status of CD34+ cells 

20 at the time point of infection with the hybrid vector AAd.AAV^BG, BrdU is added 2 
hours before infection to the culture medium. One set of cells are maintained as 
suspension culture in ID AM containing Tpo only for two days. Another set of cells are 
grown in IDAM+Tpo supplemented with multiple cytokines. Forty eight hours after 
infection, CD34+ cells can be FACS sorted for beta Gal expression using FDG. FDG 

25 positive cells can be further analyzed for cellular DNA replication based on BrdU 
incorporation and for specific CD34+ subset markers. To do this, cytospins from FDG+ 
cells can be submitted to immunofluorescence with BrdU specific antibodies and with 
antibodies to specific cell surface markers (e.g. CD38, CD41). Alternatively, consecutive 
paraffin sections of the same cell can be analyzed for (a) transgene expression by X-Gal 

30 staining, (b) DNA synthesis based on BrdU incorporation, and (c) specific surface 
markers. This allows one to confirm that the culture conditions with Tpo alone prevent 
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significant genomic DNA replication and subsequent cell proliferation as well as to 
determine whether quiescent CD34+ cells can be infected based on beta Gal expression 
in cells where BrdU labeling is absent. 

5 Test hybrid vectors integration into quiescent CD34+ cells: 

Two sets of CD34+ cells are infected. The first set of DAd.AAV fx SNori infected cells are 
cultured for 5-7 days in the presence of cytokines; the other set is cultured without 
cytokines. To maintain CD34+ cell viability without cytokines during this period, the 

10 cells are cultured in the presence of Tpo or underlaid with a stromal cell line (AFT024) 
(Moore, K. et al., 1997, Blood* 89:4337-47), which can maintain HSC viable for 4 to 7 
weeks. After this specific time period, both sets are submitted to clonogenic assays (in 
the presence of multiple cytokines) either in combination with G418 selection or without 
selection. Single colonies are analyzed morphologically and submitted to genomic DNA 

15 analysis (Figure 8) to determine the vector integration status. The ultimate proof for stem 
cell transduction is the in vivo survival/expansion assay. To do this, the CD34+ cells 
expressing beta Gal are used for transplantation experiments, If the number of FDG+ 
cells is not sufficient, total AAd.AAV fx BG infected cells as well as all AAd.AAV^-SNori 
infected cells can be used directly without selection. Transplantation can be performed 

20 via tail vein injection into sublethally irradiated SCID NOD mice (Dao, M.A., et al., 
1998, Blood, 4, 1243-1255; Matsunaga, T. et al., 1998, Blood, 92:452-61). At different 
time points after transplantation (4 to 8 weeks), mice can be sacrificed to obtain bone 
marrow cells which then can be cultured in suspension until various assays are performed 
for X-Gal and cell markers as described earlier. These cells also can be submitted to a 

25 secondary colony assay in MC or secondary transplantation into SCID NOD mice. 
Furthermore, MC colonies derived from these cells can be analyzed for the presence of 
integrated vector DNA by the method illustrated in Figure 8. The expression and 
integration data together allow conclusions about the repopulation efficiency and about 
potential position effects. 

30 
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B. Optimization of AAd.AAV f * vectors for y r-globin expression in hematopoietic 
stem cells: 

One specific example of the invention is (a) to construct retargeted hybrid vector with 
5 the y -globin as the transgene under the control of erythroid cell specific promoter, (b) to 
analyze the level and kinetics of y -globin expression after transduction with hybrid 
vectors in in vitro and in vivo assays, (c) if required, to protect gene expression from 
position effects using y -globin LCRs or insulators incorporated into hybrid vectors, and 
(d) to study whether y -globin introns or heterologous introns can increase y -globin 
10 expression. 

Another central issue of the invention is to demonstrate that hybrid vectors can 
accommodate larger transgenes than rAAV and retroviruses. The insert size limitation of 
these vectors is 5kb. Transgene cassettes up to 8kb can be inserted into hybrid vectors as 

15 described. The maximal insert size may be about I4kb, if hybrid vectors are produced on 
the basis of E2a and/or E4 deleted rAd vectors in corresponding packaging cell lines. The 
maximal insert size in hybrid vectors is dictated by the packaging limit of first generation 
vectors (Ad.AAV) (<36kb) which are necessary intermediates for hybrid virus production 
at large scale. It is expected that stability and titer of Ad.AAV vectors with an 8kb globin 

20 gene cassette is comparable to the vector containing the 2.5-3. 5kb cassette used in 
Ad.AAVBG, Ad.AAVl, and Ad.AAVSNori. The following example experiments 
address these issues. 

Production of AAd.AAJ^with large zlobin expression cassettes: 

25 

In order to improve the condition of sickle cell disease, the expression level of the 
transferred y-lobin gene must be at least 50% of that of each endogenous Pgene. These 
levels of transgene expression can only be achieved by using optimal expression 
cassettes, including extended LCRs and intron containing gamma genes. (Forrester, 
30 W.C., et al., 1986, Proc. Natl Acad Set USA 83, 1359-1363; Fraser, P., et al., 1998, 
Curr. Opinion in Cell Bio., 10, 361-365; Grosveld, F., et a.l, 1998, Seminars in 
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Hematology, 35, 105-111; Martin, D. et al., 1996, Current Opinion in Genetics and 
Development, 6:488-95), So far, most of the y-globin expression cassettes are designed 
for retroviral and rAAV vectors, thus, less than 5kb and have to be devoid of internal 
splice sites or poly adenylation signals. With integrating vectors described herein, it is 
5 possible to go beyond this size limitation. This allows one to improve y -globin 
expression in bone marrow cells in terms of an adequate expression level and long term 
persistence. For this purpose, y -globin constructs developed by Li et al (Emery, D. W., 
et al. 1999 Hum Gene Ther 10:877-88; Li, Q., et al. 1999. Blood 93:2208-16) or by Ellis 
et al (Ellis, J., et al., 1996, EMBO 7., 15, 562-568; Ellis, J., et al M 1997, Nucleic Acids 
10 Res. 25, 1296-1302) is chosen. 

(i) The first cassette contains a y -globin expression unit used in retroviral vectors. 
This allows for a direct comparison between the two systems. This construct includes the 
beta promoter from -127 to the beta initiation codon, which is connected in frame with 

15 the gamma coding region. This beta promoter is combined with the 300bp HS40 derived 
from the human alpha globin locus, which acts as a strong enhancer for globin 
expression. The globin gene is the 1.1 kb version with intron 1 and partially deleted intron 
2. A second cassette is generated containing the HS40 beta promoter and gamma globin 
gene with the complete 3.3kb gamma globin gene. 

20 

(ii) The second construct contains the 6.5kb beta (iLCR, which confer a dominant 
chromatin opening activity and an adequate level of gamma globin expression in 
transgenic mice. The LCR is linked to the short 1.1 kb version of the gamma globin gene 
or the complete 3.3kb gamma gene. 

25 

(iii) Additional globin expression cassette can be generated which include 
insulators, MARs or SARs, as well as other elements that can improve transgene 
expression from integrated vectors or in transgenic animals, like introns derived from the 
HPRT or hGH genes (Chung, J.H., et al., 1997, Proc. Natl Acad. Set USA 94, 575-580; 

30 Dunaway, M., et al, 1993, Mol. Cell Biol, 17, 182-189; Felsenfeld, G., et al., 1996, 
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Proc. Natl Acad. Set USA 93, 93840-9388; Klehr, D., et al., 1991, Biochemistry, 30, 
1264-1270). 



Transduction studies with AAd.AA Vfx-globin vectors: 



5 



Transduction studies with globin-hybrid vectors are performed as described earlier 
(Steinwaerder, D. S., et al. 1999. J Virol 73:9303-13). Transduced CD34+ cells are 
submitted to differentiation in colony assays or analyzed in vivo expansion assays in 
SCBD-NOD mice. MC-colonies or bone marrow cells from experimental mice are 
10 analyzed for globin expression. Gamma-globin expression is measured using fluorescent 
anti-gamma-globin antibodies. RNAase protection studies can be performed to 
specifically quantitate gamma globin mRNA in comparison with. -globin RNA. For these 
studies around 10 4 -10 5 cells are needed per test. 

15 Position effects: 

In the absence of the LCR, globin genes are subjected to strong position effects when 
they are transferred into cultured CD34+ cells or erythroleukemic lines (Eraser, P., et al., 
1998, Curr. Opinion in Cell Bio., 10, 361-365; Grosveld, F., et a.l, 1998, Seminars in 

20 Hematology, 35, 105-111). Another concern is that site-specific integration of 
AAd.AA V/rep vectors into AAVS1 may silence transgene expression. If silencing 
happens, it can be overcome by incorporating LCRs such as the 6.5kb □ globin jiLCR 
(Ellis, J., et al., 1996, EMBO */., 15, 562-568; Grosveld, F., et a.l, 1998, Seminars in 
Hematology, 35, 105-1 1 1) or insulators into DAd.AAV based expression units. Insulators 

25 are DNA elements that protect an integrated reporter gene from chromosomal position 
effects or that block enhancer activated transcription from a downward promoter. 
Insulator elements are known for Drosophila melanogaster genes (Gypsy, suppressor of 
Hairy wing, scs, scs', Fab-7). for the chicken beta-globin gene (HS4) and for the T cell 
receptor (BEAD1; 14, 21.25). Specifically, the Drosophila gypsy or the beta globin 

30 insulator can be inserted as two copies flanking the globin expression cassette into hybrid 
vectors. The position effects can be examined in transduced MC-colonies based on the 
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analysis of integrated vector DNA (see Fig. 29) and gamma-globin mRN A quantification. 
Analogous studies can be performed on transduced human bone marrow cells obtained 
after transplantation of infected CD34+ cells into SCID-NOD mice. 

5 Intron effects on samma-slobin expression: 

A number of reports reveal that the deletion of globin introns, particularly the second 
intron of the beta and gamma genes, decrease globin mRNA stability and thus the 
expression level (Antoniou, M. et al., 1998, Nucleic Acid Res., 26:721-9). RNA viruses 
10 such as onco-retro, lenti- and foami viruses are problematic as vehicles for intron- 
containing transgenes. Because AAd.AAV is a DNA virus, it should package globin 
introns and LCRs if necessary without the decreased titers and rearrangements observed 
with retroviral vectors. 
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APPENDIX I 

HUMAN AND ANIMAL ADENOVIRUSES AVAILABLE FROM 
AMERICAN TYPE CULTURE COLLECTION 

1: Adenovirus Type 21 ATCC VR-1099 (NIAID V-221 -002-01 41 
?• SA1 ft /"Simian adenovirus 181 ATCC VR-943 Classification 
3: SA17 (Si mian aden ovirus 171 ATCC VR-942 Classification 
4: Adenovirus Type 47 ATCC VR-1309 Classification: Adenov 
5: Adenovirus Type 44 ATCC VR-1306 Classification: Adenov 
6: Avian adenovirus Type 4 ATCC VR-829 Classification: Ad 
7: Avian adenovirus Type 5 ATCC VR-830 Classification: Ad 
8: Avian adenovirus Type 7 ATCC VR-832 Classification: Ad 
9: Avian adenovirus Type 8 ATCC VR-833 Classification: Ad 
10: Avian adenovirus Tvpe 9 ATCC VR-834 Classification: Ad 
11: Avian adenovirus Type 10 ATCC VR-835 Classification: A 
12: Avian adenovirus Type 2 ATCC VR-827 Classification: Ad 
13: Adenovirus Type 45 ATCC VR-1307 Classification: Adenov 
14: Adenovirus Type 38 ATCC VR-988 Permit: PHS permit reau 
15: Adenovirus Tvpe 46 ATCC VR-1308 Classification: Adenov 
16: Simian ad enovirus ATCC VR-541 Classification: Adenovir 
17: SA7 (Simian adenovirus 161 ATCC VR-941 Classification: 
18: Frog adenovirus (FAV-11 ATCC VR-896 Classification: Ad 
19: Adenovirus Type 48 (candidatel ATCC VR-1406 Classifies 
20: Adenovirus Type 42 ATCC VR-1304 Classification: Adenov 
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21 : Adenovirus type 49 (candidate^ ATCC VR-1407 Classifica 
22: Adenovirus Type 43 ATCC VR-1305 Classification: Adenov 
23: Avian adenovirus Type 6 ATCC VR-831 Permit: USDA permi 
24: Avian adenovirus Type 3 (Inclusion body hepatitis virus) 
25: Bovine adenovirus Type 3 ATCC VR-639 Classification: A 
26: Bovine adenovirus Type 6 ATCC VR-642 Permit: USDA perm 
27: Canine adenovirus ATCC VR-800 Classification: Adenovir 
28: Bovine adenovirus Type 5 ATCC VR-641 Permit: USDA perm 
29: Adenovirus Type 36 ATCC VR-913 Classification: Adenovi 
30: Ovine adenovirus type 5 ATCC VR-1343 Classification: A 
31: Adenovirus Type 29 ATCC VR-272 Classification: Adenovi 
32: Swine adenovirus ATCC VR-359 Classification: Adenoviru 
33: Bovine adenovirus Type 4 ATCC VR-640 Permit: USDA perm 
34: Bovine adenovirus Type 8 ATCC VR-769 Permit: USDA perm 
35: Bovine adenovirus Type 7 ATCC VR-768 Permit: USDA perm 
36: Adeno-associated virus Type 2 (AAV-2HD ATCC VR-680 Cla 
37: Adenovirus Type 4 ATCC VR-4 Classification: Adenovirus 
38: Adeno-associated virus Type 3 (AAV-3HT) ATCC VR-681 Cla 
39: Peromvscus adenovirus ATCC VR-528 Classification: Aden 
40: Adenovirus Type 15 ATCC VR-661 Classification: Adenovi 
41 : Adenovirus Type 20 ATCC VR-662 Classification: Adenovi 
4?r rhimpflngge ad enovirus ATCC VR-593 Classification: Aden 
43: Adenovirus Type 31 ATCC VR-357 Classification: Adenovi 
44: Adenovirus Type 25 ATCC VR-223 Classification: Adenovi 
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45: Chimpanzee adenovirus ATCC VR-592 Classification: Aden 
46: Chimpanzee adenovirus ATCC VR-591 Classification: Aden 
47: Adenovirus Type 26 ATCC VR-224 Classification: Adenovi 
48: Adenovirus Type 19 ATCC VR-254 Classification: Adenovi 
5 49: Adenovirus Type 23 ATCC VR-258 Classification: Adenovi 
50: Adenovirus Type 28 ATCC VR-226 Classification: Adenovi 
51 : Adenovirus Type 6 ATCC VR-6 Classification: Adenovirus 
52: Adenovirus Type 2 Antiserum: ATCC VR-1079 AS/Rab (NIA 
53: Adenovirus Type 6 ATCC VR-1083 (NIAID V-206-003-014^ 

io 54: Ovine adenovirus type 6 ATCC VR-1340 Classification: A 

55: Adenovirus Type 3 ATCC VR-847 (Derived from NIAID V-20^ 
56: Adenovirus Type 7 ATCC VR-7 Classification: Adenovirus 
57: Adenovirus Type 39 ATCC VR-932 Classification: Adenovi 
58: Adenovirus Type 3 ATCC VR-3 Classification: Adenovirus 

is 59: Bovine adenovirus Type 1 ATCC VR-3 13 Classification: A 
60: Adenovirus Type 14 ATCC VR-15 Classification: Adenovir 
61: Adenovirus Type 1 ATCC VR-1078 (NIAID V-201-001-014'> 
62: Adenovirus Type 21 ATCC VR-256 Classification: Adenovi 
63: Adenovirus Type 18 ATCC VR-109S (NIAID V-218-003-014 , > 

20 64: Baboon adenovirus ATCC VR-275 Classification: Adenovir 
65: Adenovirus Type 10 ATCC VR-1 1 Classification: Adenovir 
66: Adenovirus Type 33 ATCC VR-626 Classification: Adenovi 
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67: Adenovirus Type 34 ATCC VR-716 Classification: Adenovi 
68: Adenovirus Type IS ATCC VR-16 Classification: Adenovir 
69: Adenovirus Type 22 ATCC VR-257 Classification: Adenovi 
70: Adenovirus Type 24 ATCC VR-259 Classification: Adenovi 
71: Adenovirus Type 17 ATCC VR-1094 (NIAID V-21 7-003 -014^ 
72: Adenovirus Type 4 ATCC VR-1081 fNIAID ^204-003-014^ 
73: Adenovirus Type 16 ATCC VR-17 Classification: Adenovir 
74: Adenovirus Type 17 ATCC VR-18 Classification: Adenovir 
75: Adenovirus Type 16 ATCC VR-1093 fNIAID V-216-003-01 4) 
76: Bovine adenovirus Type 2 ATCC VR-314 Classification: A 
77: SV-30 ATCC VR-203 Classification: Adeno virus. Simian ( 
78: Adenovirus Type 32 ATCC VR-625 Classification: Adenovi 
79: Adenovirus Type 20 ATCC VR-255 Classification: Adenovi 
80: Adenovirus Type 13 ATCC VR-14 Classification: Adenovir 
81: Adenovirus Type 14 ATCC VR-1091 (NIAID V-214-001-014^ 
82: Adenovirus Type 18 ATCC VR-19 Classification: Adenovir 
83: SV-39 ATCC VR-353 Classification: Adenovi rus. Simian ( 
84: Adenovirus Type 1 1 ATCC VR-849 (Derived from NIAID V-2 
85: Duck adenovirus (Egg drop syndrome^ ATCC VR-921 Permi 
86: Adenovirus Type 1 ATCC VR-1 Classification: Adenovirus 
87: Chimpanzee adenovirus ATCC VR-594 Classification: Aden 
88: Adenovirus Tvne 15 ATCC VR-1 092 (NIAID V-2 15-003 -01 4^ 
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89: Adenovirus Type 13 ATCC VR-1090 (NIAID V-213-003-014^ 
90: Adenovirus Type 8 ATCC VR-1368 rDerived from NTATD V-20 
91: SV-31 ATCC VR-2Q4 Classification: Adenovirus , Simian ( 
92: Adenovirus Type 9 ATCC VR-1086 fNTTATD V^Qg-OOS-Ol^ 
93: Mouse adenovirus ATCC VR-550 Classification: Adenovim 
94: Adenovirus Type 9 ATCC VR-10 Classification: Adenovim 
95: Adenovirus Type 41 ATCC VR-930 Classification: Adenovi 
96: CI ATCC VR-20 Classification: Adenovi rus- Simian (Maet 
97: Adenovirus Type 40 ATCC VR-931 Classification: Adenovi 
98: Adenovirus Type 37 ATCC VR-929 Classification: Adenovi 
99: Marble spleen disease virus (Hemorrhagic enteritis virus 
100: Adenovirus Type 35 ATCC VR-718 Classification: Adenovi 
101: SV-32 (M3> ATCC VR-205 Classification: Adenovirus. Sim 
102: Adenovirus Type 28 ATCC VR-1106 (NIAID V-228-003-014 > > 
103: Adenovirus Type 10 ATCC VR-1087 (NT AID V-210-003-014'> 
104: Adenovirus Type 20 ATCC VR-1097 (NIAID V-220-003-014^ 
105: Adenovirus Type 21 ATCC VR-1098 (NIAID V-221-011-014^ 
106: Adenovirus Tvne 25 ATCC VR-1103 (NIAID ^225-003-014^ 
107: Adenovirus Type 26 ATCC VR-1104 (NIAID V-226-003-0141 
108: Adenovirus Tvoe 31 ATCC VR-1109 (NIAID V-23 1-001-0141 
109: Adenovirus Tvne 19 ATCC VR-1096 (NIAID V-2 19-002-014') 
110: SV-36 ATCC VR-208 Classification: Adenovirus, Simian ( 
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111: SV-38 ATCC VR-355 Classification: Adenov irus, Simian ( 
112: SV-25 (MM ATCC VR-201 Classification: Adenovirus. Sim 
113: SV-15 fM41 ATCC VR-197 Classification: Adenovirus. Sim 
114: Adenovirus Type 22 ATCC VR-1 100 fNIAID V-222-003-0141 
115: SV-23 (M2) ATCC VR-200 Classification: Adenovirus. Sim 
116: Adenovirus Tvoe 11 ATCC VR-1 2 Classification: Adenovir 
117: Adenovirus Type 24 ATCC VR-1 102 fNIAID ^224-003-0141 
118: Avian adenovirus Type 1 (Chicken Embryo Lethal Orphan: C 
119: SV-11 CMS) ATCC VR-1 96 Classification: Adenovirus. Sim 
120: Adenovirus Type 5 ATCC VR-5 Classification: Adenovirus 
121: Adenovirus Type 23 ATCC VR-1101 fNIAID V-223-003-0141 
122: SV-27 CMS) ATCC VR-202 Classification: Adenovirus. Sim 
123: Avian adenovirus Type 2 (GAL) ATCC V-280 Classificati 
124: SV-1 (Mi) ATCC VR-195 Classificatio ns Adenovirus, Simi 
125: SV-17 (M6) ATCC VR-198 Classification: Adenovirus. Sim 
126: Adenovirus Type 29 ATCC VR-1 107 fNIAID V-229-003-014') 
127: Adenovirus Type 2 ATCC VR-846 Classification: Adenovir 
128: SV-34 ATCC VR-207 Classification: Adeno virus, Simian f 
129: SV-20 (MTi ATCC VR-1 99 Classification: Adenovirus. Sim 
130: SV-37 ATCC VR-209 Classification: Adeno virus, Simian f 
131: SV-33 (M10) ATCC VR-206 Classification: Adenovirus. Si 
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132: Avian adeno-associated virus ATCC VR-865 Classificatio 
133: Adeno-associated (satellite) virus Type 4 ATCC VR-646 
134: Adenovirus Type 30 ATCC VR-273 Classification: Adenovi 
135: Adeno-associated (satellite) virus Type 1 ATCC VR-645 
136: Infectious canine h epatitis fRubarth's disease. Fox ence 
137: Adenovirus Type 27 ATCC VR-1105 (NIAID V-227-003-014) 
138: Adenovirus Type 12 ATCC VR-863 (Derived from NIAID V-2 
139: Adeno-associated virus Type 2 (molecularlv cloned) ATCC 
140: Adenovirus Type 7a ATCC VR-848 (Derived from TMTAm V-? 
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What is claimed: 
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A first generation recombinant adenovirus vector a portion of which integrates 
into a host cell genome, comprising: 

a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3' to the left adenovirus inverted 
terminal repeat sequence; 

c) A transgene cassette sequence 3' to the adenoviral packaging sequence; 

d) At least one adenoviral sequence which directs adenoviral replication; and 

e) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of the 
transgene cassette sequence into the host cell genome. 

The adenovirus vector of claim 1, wherein the left and right adenovirus inverted 
repeat sequence and the packaging sequence are from the same adenoviral 
serotype. 

The adenovirus vector of claim 1, wherein the sequence which directs adenoviral 
replication comprises a sequence on the anti-parallel strand which encodes an 
adenoviral fiber protein including a fiber tail, a fiber shaft, and a fiber knob, 
wherein the fiber knob includes a G-H loop region. 

The adenovirus vector of claim 3, wherein the sequence on the anti-sense strand 
which encodes the fiber tail is from the same serotype as the adenoviral inverted 
repeat sequence. 

The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3' to the left inverted terminal repeat 
sequence; 
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c) A transgene sequence 3' to the polyadenylation sequence; 

d) A promoter sequence 3 5 to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 



The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A promoter sequence 3' to the left inverted terminal repeat sequence; 

c) A transgene sequence 3' to the promoter sequence; 

d) A polyadenylation sequence 3' to the transgene sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 



The adenovirus vector of claim 5 or 6, wherein the left and the right cassette 
inverted terminal repeat sequences each comprise an adenoviral-associated 
inverted terminal repeat sequence. 

The adenovirus vector of claim 5 or 6, wherein the transgene sequence is selected 
from a group consisting of a therapeutic gene, a selectable gene, and a reporter 
gene. 

The adenovirus vector of claim 8, wherein the therapeutic gene is selected from a 
group consisting of gamma globin, and human alpha- 1 anti-trypsin. 

The adenovirus vector of claim 8, wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline, and gentamycin. 

The adenovirus vector of claim 8, wherein the reporter gene is selected from a 
group consisting of green fluorescent protein, beta galactosidase, alkaline 
phosphatase. 



sequence. 
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12. The transgene cassette of claim 5 or 6 further comprising an inverted repeat 
sequence located 3 s to the left inverted terminal repeat sequence or located 5 9 to 
the right inverted terminal repeat sequence. 

13. The transgene cassette of claim 5 or 6, further comprising an insulator element. 

14. The transgene cassette of claim 5 or 6, further comprising a bacterial origin of 
replication. 

15. The adenoviral vector of claim 1, wherein the adenoviral sequences which direct 
adenoviral replication are selected from a group consisting of E2 and E4; El, E2 
and E4; E2 and E4; and E2, E3, and E4. 

16. A first generation recombinant adenovirus vector which targets a host cell of 
interest and a portion of which integrates into the host cell genome so targeted, 
comprising two DNA strands, each strand being antiparallel to the other, the first 
strand comprising: 

a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3' to the left adenovirus inverted 
terminal repeat sequence; 

c) A transgene cassette sequence 3 s to the adenoviral packaging sequence; 

d) At least one adenoviral sequence which directs adenoviral replication; and 

e) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of the 
transgene cassette sequence into the host cell genome, and wherein the second 
strand comprises a sequence which encodes an adenoviral fiber protein that 
permits targeting of the vector into the host cell of interest. 
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17. The adenovirus vector of claim 16, wherein the adenoviral protein includes a fiber 
tail, a fiber shaft, and a fiber knob, wherein the fiber knob includes a G-H loop 
region. 

1 8. The adenovirus vector of claim 16, wherein the left and right adenovirus inverted 
terminal repeat sequences and the packaging sequence are from the same 
adenoviral serotype. 

19. The adenovirus vector of claim 17, wherein the fiber tail is from the same 
serotype as the left and right adenoviral inverted repeat sequences. 

20. The adenovirus vector of claim 17, wherein the fiber shaft is from a different 
serotype as the left and right adenoviral inverted repeat sequences. 

21. The adenovirus vector of claim 20, wherein the fiber shaft is from a serotype 
selected from a group consisting of serotype 3, 7, 9, 1 1, and 35. 

22. The adenovirus vector of claim 17, wherein the fiber shaft comprises a shortened 
length. 

23. The adenovirus vector of claim 17, wherein the fiber knob is from a different 
serotype as the left and right adenoviral inverted repeat sequences. 

24. The adenovirus vector of claim 23, wherein the fiber knob is from a serotype 
selected from a group consisting of serotype 3, 7, 9, 1 1, and 35. 

25. The adenovirus vector of claim 17, wherein the fiber knob is a modified fiber 
knob protein comprising the G-H loop replaced with a heterologous peptide 
ligand sequence which binds to at least one surface protein on the host cell of 
interest. 
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26. The adenovirus vector of claim 16, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3* to the left inverted terminal repeat 
sequence; 

c) A transgene sequence 3* to the polyadenylation sequence; 

d) A promoter sequence 3 * to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

27. The adenovirus vector of claim 16, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A promoter sequence 3' to the left inverted terminal repeat sequence; 

c) A transgene sequence 3 ' to the promoter sequence; 

d) A polyadenylation sequence 3 ' to the transgene sequence; and 

e) A right cassette inverted terminal repeat sequence 3* to the polyadenylation 
sequence. 

28. The adenovirus vector of claim 26 or 27, whereinthe left and the right cassette 
inverted terminal repeat sequences each comprise an adenoviral-associated 
inverted terminal repeat sequence. 

29. The adenovirus vector of claim 26 or 27, wherein the transgene sequence is 
selected from a group consisting of a therapeutic gene, a selectable gene, and a 
reporter gene. 

30. Hie adenovirus vector of claim 24, wherein the therapeutic gene is selected from 
a group consisting of gamma globin, and human alpha- 1 anti-trypsin. 
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31. The adenovirus vector of claim 24, wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline, and gentamycirL 

32. The adenovirus vector of claim 24, wherein the reporter gene is selected from a 
group consisting of green fluorescent protein, beta galactosidase, alkaline 
phosphatase. 

33. The transgene cassette of claim 26 or 27 further comprising an inverted repeat 
sequence located 3* to the left inverted terminal repeat sequence or located 5' to 
the right inverted terminal repeat sequence. 

34. The transgene cassette of claim 26 or 27, further comprising an insulator element 

35. The transgene cassette of claim 26 or 27, further comprising a bacterial origin of 
replication. 

36. The adenoviral vector of claim 16, wherein the adenoviral sequences which direct 
adenoviral replication are selected from a group consisting of E2 and E4; El, E2 
andE4; E2 andE4; andE2, E3, and E4. 

37. A recombinant gutless adenovirus vector a portion of which integrates into a host 
cell genome, comprising: 

1 a) A left adenovirus inverted terminal repeat sequence; 

b) An adenoviral packaging sequence 3* to the left adenovirus inverted 
terminal repeat sequence; 

c) A transgene cassette sequence 3 * to the adenoviral packaging sequence; 
and 

d) A right adenoviral inverted terminal repeat sequence, 

wherein the left and right terminal repeat sequences permit integration of die 
transgene cassette sequence into the host cell genome. 
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38. The adenovirus vector of claim 36, wherein the left and right adenovirus inverted 
repeat sequence and the packaging sequence are from the same adenoviral 
serotype, (this is the base vector) 

39. The adenovirus vector of claim 36, wherein the transgene cassette sequence 
comprises: 

a) A left cassette inverted terminal repeat sequence; 

b) A polyadenylation sequence 3* to the left inverted terminal repeat 
sequence; 

c) A transgene sequence 3* to the polyadenylation sequence; 

d) A promoter sequence 3' to the polyadenylation sequence; and 

e) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

40. The adenovirus vector of claim 36, wherein the transgene cassette sequence 
comprises: 

b) A left cassette inverted terminal repeat sequence; 

c) A promoter sequence 3' to the left inverted terminal repeat sequence; 

d) A transgene sequence 3' to the promoter sequence; 

e) A polyadenylation sequence 3' to the transgene sequence; and 

f) A right cassette inverted terminal repeat sequence 3' to the polyadenylation 
sequence. 

41. The adenovirus vector of claim 38 or 39, wherein the left and the right inverted 
terminal repeat sequences each comprise an adenoviral-associated inverted 
terminal repeat sequence. 

42. The adenovirus vector of claim 38 or 39, wherein the transgene sequence is 
selected from a group consisting of a therapeutic gene, a selectable gene, and a 
reporter gene. 
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The adenovirus vector of claim 41 , wherein the therapeutic gene is selected from 
a group consisting of gamma globin, and human alpha- 1 anti-trypsin. 

The adenovirus vector of claim 41, wherein the selectable gene is selected from a 
group consisting of neomycin, ampicillin, penicillin, tetracyline, and gentomycin. 

The adenovirus vector of claim 41, wherein the reporter gene is selected from a 
group consisting of green fluorescent protein, beta galactosidase, alkaline 
phosphatase. 

The adenovirus vector of claim 38 or 39 further comprising an inverted repeat 
sequence located 3' to the left inverted terminal repeat sequence or located 5' to 
the right inverted terminal repeat sequence. 

The adenovirus vector of claim 38 or 39, further comprising an insulator element. 

The adenovirus vector of claim 38 or 39, further comprising a bacterial origin of 
replication. 

The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises a 5' portion of a gene of interest 

The adenovirus vector of claim 1, wherein the transgene cassette sequence 
comprises a 3* portion of a gene of interest. 

A method of producing a resolved gutless adenovirus vector in a suitable cell, said 
method comprising introducing a first and a second adenovirus vectors of claim 1 
or 16 into the cell under suitable conditions so that the recombinant adenovirus 
vectors undergo homologous recombination thereby producing a resolved gutless 
adenovirus vector. 
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52. A resolved gutless adenovirus vector produced by the method of claim 50. 

53. Hie method of claim 50, wherein the first adenovirus vector comprises a 
transgene cassette having a 5* portion of a gene of interest, and wherein the 

5 second adenovirus vector comprises a transgene cassette having a 3' portion of 

the gene of interest, and wherein a part of the 5' portion overlaps with a part of 
the 3* portion so that homologous recombination occurs. 

54. A method of producing a resolved gutless recombinant Ad vector by homologous 
10 recombination in a suitable cell, said method comprising contacting two parental 

recombinant Ad vectors, each comprising a transgene cassette containing a 
portion of a selected transgene with a region of overlapping homology, so that the 
first and second parental recombinant Ad vectors undergo homologous 
recombination at the region of overlapping homology, resulting in a resolved 
15 recombinant gutless Ad vector having both portions of the selected transgene, and 

wherein the selected transgene is within a transgene cassette flanked by a pair of 
ITEls. 



20 



55. A resolved gutless adenovirus vector produced by the method of claim 53. 

56. An adenovirus library comprising a plurality of adenovirus vectors expressing 
fiber proteins which are displayed and modified by random peptide insertions. 

57. The library of claim 55, wherein said fiber protein so displayed comprises a 
25 random peptide substituted in the G-H loop of the fiber protein knob domain. 

58. A screening method for targeting adenovirus vectors for gene therapy comprising 
contacting the adenovirus library of claim 55 with a plurality of cells so that the 
cells are transduced with the adenovirus vectors of the adenovirus library 

30 transduction occurs and detecting the cells so transduced. 
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59. The adenovirus vector of claim 5, 6, 26, 27, 38, or 39 further comprising a 
nucleotide sequence encoding a rep78 protein. 
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VECTORS FOR OCULAR TRANSDUCTION AND USE THEREOF FOR GENETIC 

THERAPY 

Work described herein was supported by NIH grants EY1 1431 and 

HL54352. The government has certain rights in such subject matter. 
5 RELATED APPLICATIONS 

This application claims the benefit of priority to U.S. application Serial No. 

09/562,934, filed May 1, 2000, to Glen R. Nemerow, Daniel Von Seggern,; 

Martin Friedlander, entitled "VECTORS FOR OCULAR TRANSDUCTION AND USE 

THEREFOR FOR GENETIC THERAPY". 
10 This application is related to copending U.S.. application Serial 

No. 09/482,682 (also filed as International PCT application No. 

PCT/USOO/00265, filed January 14, 2000)), to Daniel Von Seggern, Glen R. 

Nemerow, Paul Hallenbeck, Susan Stevenson, Yelena Skripchenko, filed January 

1.4, 2000, entitled "Adenovirus Vectors, Packaging Cell Lines, Compositions, and 
15 Methods for Preparation and Use," which is a continuation-in-part of U.S. 

Application 09/423,783 filed November 12, 1999 and claims the benefit of the 

filing date of U.S. Provisional Application 60/1 15,920 filed January 14, 1999. 

Where permitted, the contents and subject matter of each application and of the 

provisional application are incorporated in their entirety herein by reference. 
20 FIELD OF INVENTION 

The present invention relates to gene therapy, especially to adenovirus 

vector-based gene therapy. In particular, adenovirus vectors for therapy of 

ocular diseases and methods of treatment using the vectors are provided. 

Compositions, kits, and methods of preparation and use of the vectors for gene 
25 therapy are provided. 

BACKGROUND OF THE INVENTION 

Retinal dystrophies 

The eye is susceptible to a number of hereditary and/or age related 

degenerative disorders. In the United States, common causes of irreversible 
30 blindness or severe loss of vision are retinal dystrophies (see, e.g., Cotlier et aL 

(1995) Surv. Ophthalmology 40:51-61; Bird (1995) Am. J. OphthaL 119: 

543-562; and Adler (1996) Arch Ophthal 1 74:79-83). The retina is the sensory 
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tunic of the eye, containing light sensitive receptors, a complex of neurons, and 
pigmented epithelium, arranged in discrete layers. In humans, the macula is the 
portion of the retina that lies directly behind the lens. Cones, the photoreceptor 
cells responsible for central vision, are heavily concentrated in the macula. 
5 Central dystrophies, which affect the macula, include Best's disease, age-related 
macular degeneration, and Stargardt's macular dystrophy. The peripheral retina 
is composed mainly of rods, which are responsible for side and night vision. 
Peripheral degenerative retinal diseases include retinitis pigmentosa, choroidemia 
and Bietti's crystalline dystrophy. 

10 Macular degenerations are a heterogenous group of diseases, 

characterized by progressive central vision loss and degeneration of the macula 
and underlying retinal pigmented epithelium. Age-related macular degeneration 
(ARMD) is the most common form of the disease, affecting an estimated 20% of 
persons over 75 years of age. ARMD is poorly understood in terms of etiology 

15 and pathogenesis. The very late onset of the disease has made genetic mapping 
particularly difficult. Certain macular degenerative conditions with a clear genetic 
basis, such as Stargardt's and Best's diseases, share many features with ARMD, 
but have been more amenable to molecular and genetic analysis. 

Hereditary peripheral retinopathies are also relatively common. Retinitis 

20 pigmentosa (RP), for example, affects approximately 1 .5 million people 
worldwide. Substantial genetic heterogeneity has been observed in this 
condition, with over 20 chromosomal loci identified. A predisposition to retinitis 
pigmentosa can be inherited by autosomal dominant, autosomal recessive, 
X-linked or digenic mode. Mutations have been identified in seven genes, four of 

25 which encode proteins in the rod phototransduction cascade: rhodopsin, alpha 
and beta subunits of rod cGMP phosphodiesterase, and rod cGMP cation-gated 
channel protein .alpha, subunit. Mutations in the peripherin/RDS gene have been 
linked to retinitis pigmentosa and macular degeneration. A single peripherin/RDS 
mutation apparently caused retinitis pigmentosa, pattern dystrophy and fundus 

30 flavimaculatus, in different family members. 
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ln spite of causal heterogeneity, there is significant clinical similarity 
among RP subtypes. Common signs and symptoms include early 
electroretinographic abnormalities, ophthalmoscopic findings, and 
protracted, contiguous expansion of the ring-like scotoma toward the macula, 
5 leading to progressively worsening tunnel vision. A recent hypothesis is that 
active photoreceptor cell death, which is characteristic of these genetically 
distinct disorders, is mediated by a common induction of apoptosis. It may be 
possible to treat these conditions by the administration of agents that block 
induction of apoptosis in photoreceptors, such as neurotrophic factors. 

10 Adenovirus delivery vectors 

Adenovirus, which is a DNA virus with a 36 kilobase <kb) genome, is very 
well-characterized and its genetics and genetic organization are understood. 
The genetic organization of adenoviruses permits substitution of large fragments 
of viral DNA with foreign DNA. In addition, recombinant adenoviruses are 

15 structurally stable and no rearranged viruses are observed after extensive 
amplification. 

Adenoviruses have been employed as delivery vehicles for introducing 
desired genes into eukaryotic cells. The adenovirus delivers such genes to 
eukaryotic cells by binding to cellular receptors followed by internalization. The 

20 adenovirus fiber protein is responsible for binding to. cells. The fiber protein has 
two domains, a rod-like shaft portion and a globular head portion that contains 
the receptor binding region. The fiber spike is a homotrimer, and there are 1 2 
spikes per virion. Human adenoviruses bind to and infect a broad range of 
cultured cell lines and primary tissues from different species. 

25 The 35,000+ base pair (bp) genome of adenovirus type 2 has been 

sequenced and the predicted amino acid sequences of the major coat proteins 
(hexon, fiber and penton base) have been described (see, e.g., Neumann eta/., 
Gene 69: 1 53-1 57 (1 988); Herisse et el, Nuc. Acids Res. 9: 4023-4041 (1 981 ); 
Roberts et aL, J. Biol. Chem. 259: 13968-13975 (1984); Kinloch et al., J. Biol. 

30 Chem. 253: 6431-6436 (1984); and Chroboczek eta/., Virol. 161: 549-554, 
1987). 
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The 35,935 bp sequence of Ad 5 DNA is also known and portions of 
many other adenovirus genomes have been sequenced. The upper packaging 
limit for adenovirus virions is about 105% of the wild-type genome length {see, 
e.g., Bett, eta/., J. Virol. 67(10): 591 1-21,. 1993). Thus, for Ad2 and Ad5, this 
5 would be an upper packaging limit of about 38kb of DNA. 

Adenovirus DNA also includes inverted terminal repeat sequences (ITRs) 
ranging in size from about 100 to 150 bp, depending on the serotype. The 
inverted repeats permit single strands of viral DNA to circularize by base-pairing 
of their terminal sequences to form base-paired "panhandle" structures that are 

10 required for replication of the viral DNA. 

For efficient packaging, the ITRs and the packaging signal (a few hundred bp in 
length) comprise the "minimum requirement" for replication and packaging of a 
genomic nucleic acid into an adenovirus particle. Helper-dependent vectors 
lacking all viral ORFs but including these essential c/s elements (the ITRs and 

15 contiguous packaging sequence) have been constructed. 

Ad vectors have several distinct advantages as gene delivery vehicles. 
For example, recombination of such vectors is rare; there are no known 
associations of human malignancies with adenoviral infections despite common 
human infection with adenoviruses; the genome may be manipulated to 

20 accommodate foreign genes of a fairly substantial size; and host proliferation is 
not required for expression of adenoviral proteins. Adenovirus (Ad)-based gene 
delivery vectors efficiently infect many different cells and tissues. This broad 
tropism, however, means that gene delivery cannot be directed to a specific 
target cell. A large fraction of intravenously administered adenovirus is 

25 retained by the liver, which could lead to undesirable side-effects. Adenovirus 
may potentiate immune responses. For example, Adenovirus type 5 (Ad5) also 
transduces dendritic cells, which present antigens very efficiently, thereby 
possibly exacerbating the immune response against the vector. It has been 
proposed that vectors with different targeting efficiencies might eliminate these 

30 problems, permitting a lower total particle dose and more specific targeting (see, 
e.g., U.S. application Serial No. 09/482,682). 
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The wealth of information on adenovirus structure and mechanism of 
infection, its efficient infection of nondividing cells, and its large genetic capacity 
make adenovirus a popular gene therapy vector. The wide expression of 
receptors to which adenovirus binds makes targeting adenovirus vectors 
5 difficult. 

Hence there is a need to improve delivery and targeting of adenoviral 
vectors and also to provide treatments for ocular disorders. Therefore, it is an 
object herein to provide adenoviral vectors that specifically or selectively target 
cells in the eye. It is also an object herein to provide these vectors for treatment 

10 of ocular disorders. 

SUMMARY OF THE INVENTION 

. Degenerative ocular diseases, such as, but not limited to, retinitis 
pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, 
and others (see, e.g., Table below), have a genetic basis. Genes expressed in 

15 the photoreceptor cells at the back of the retina are implicated in these diseases. 
Provided herein are recombinant viral vectors for targeting therapeutic products 
to these cells. 

Recombinant adenoviral vectors that include nucleic acid that permits 
specific binding to these photoreceptors are provided. In particular, the vector 

20 particles contain a fiber protein of Ad37 or a modified form thereof. As shown 
herein, fiber protein from Ad37 permits efficient infection of photoreceptor cells. 
Fiber proteins from other adenovirus D serotypes may also be used. In addition, 
the portions of the fiber protein, particularly those that interact with other viral 
structural proteins, such as penton, may be modified to resemble the viral source 

25 of the other structural proteins. As exemplified herein, the recombinant virus 
provided herein include Ad5 structural components. The N-terminus of the Ad37 
fiber protein, which interacts with the penton protein, is modified to resemble 
the Ad 5 fiber protein N-terminus to ensure production of viral particles. 

The recombinant adenoviral vectors are intended for gene therapy of 

30 diseases in which genes expressed in the photoreceptors are implicated. Such 
diseases include, but are not limited to, degenerative ocular diseases, such as 
retinitis pigmentosa and Stargardt's disease. These vectors are also useful for 
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targeting to other ocular cells, such as conjunctival cells, which also bear 
receptors to which fiber from Ad37 and related serotypes bind. 

The vectors will deliver therapeutic agents to the targeted cells for 
treatment of a variety of disorders (see e.g., Tables 3 and 4, below)). The 
therapeutic agents are intended for expression in the photoreceptors and for 
secretion from the photorecptor cells, which are surrounded on one side by 
choroidal vasculature, and on the other side by retinal vasulature, thereby 
providing a means for delivery of products. In addition, expression of growth 
factors, such as VEGF and others, can be used to enhance blood flow, to the 
retina and prevent or slow the degeneration. 

Therapeutic agents encoded by the recombinant adenoviral vectors 
include, but are not limited to, nucleic acid nucleic acid molecules encoding 
genes that are defective in certain hereditary disorders, nucleic acid molecules 
that encode antiangiogenics and antitumor agents for treatment of retinal 
disorders, such as retinoblastomas; nucleic acid molecules encoding trophic 
factors, such as glial cell line-derived neuroptrophic factor (GDNR and ciliary 
neurotrophic factor (CNTF), growth factors and growth factor inhibitors, 
antiapoptotic factors, such as Bcl-2 (CNTF), antitumor agents, anti-angiogenics, 
and genes or portions thereof for gene replacement or repair of defective genes. 
Hence, methods for treatment of inherited and acquired retinal diseases, 
including diseases involving neovascular and vascular degeneration are provided. 

Methods for treating diseases involving genes expressed in photoreceptor 
cells are provided herein. The methods provided herein are practiced by 
administration of the recombinant viral vectors by any means suitable for 
delivery to the photoreceptors. A preferred mode of administration is intraocular 
injection including intravitreal and subretinal injection. Other modes of 
administration include, but are not limited to, intrascleral, periorbital and 
intravenous administration. The vectors also can include photoreceptor-specific 
promoters thereby providing a means, not only for specific targeting of 
expression in these cells, but also for photoreceptor-restricted transgene 
expression. 
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DETAILED DESCRIPTION OF THE INVENTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
which this invention belongs. All patents, applications, published applications 
and other publications and sequences from GenBank and other data bases 
referred to anywhere in the disclosure herein are incorporated by reference in 
their entirety. 

As used herein, the amino acids, which occur in the various amino acid 
sequences appearing herein, are identified according to their three-letter or one- 
letter abbreviations. The nucleotides, which occur in the various DNA 
fragments, are designated with the standard single-letter designations used 
routinely in the art (see, Table 1). 

As used herein, amino acid residue refers to an amino acid formed upon 
chemical digestion (hydrolysis) of a polypeptide at its peptide linkages. The 
amino acid residues described herein are preferably in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino 
acid residue, as long as the desired functional property is retained by the 
polypeptide. NH 2 refers to the free amino group present at the amino terminus 
of a polypeptide. COOH refers to the free carboxy group present at the carboxyl 
terminus of a polypeptide. In keeping with standard polypeptide nomenclature 
described in J. Biol. Chem., 243:3552-59 (1969) and adopted at 37 C.F.R. § § 
1 .821 - 1 .822, abbreviations for amino acid residues are shown in the following 
Table: 

Table 1 



SYMBOL 




1 -Letter 


3-Letter 


AMINO ACID 


Y 


Tyr 


tyrosine 


G 


Giy 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 
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SYMBOL 




A 


Ala 


alanine 


S 


Ser 


serine 


I 


He 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


2 


Glx 


Glu and/or Gin 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



It should be noted that all amino acid residue sequences represented . 
herein by formulae have a left to right orientation in the conventional direction o 
amino-termmus to carboxyl-terminus. In addition, the phrase "amino acid 
residue" is broadly defined to include the amino acids listed in the Table of 
25 Correspondence and modified and unusual amino acids, such as those referred 
to in 37 C.F.R. § § 1.821-1.822, and incorporated herein by reference. 
Furthermore, it should be noted that a dash at the beginning or end of an amino 
acid residue sequence indicates a peptide bond to a further sequence of one or 
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more amino acid residues or to an amino-terminal group such as NH 2 or to a 
carboxyl-terminal group such as COOH. 

In a peptide or protein, suitable conservative substitutions of amino acids 
are known to those of skill in this art and may be made generally without altering 
5 the biological activity of the resulting molecule. Those of skill in this art 
recognize that, in general, single amino acid substitutions in non-essential 
regions of a polypeptide do not substantially alter biological activity (see, e.g., 
Watson et al Molecular Biology of the Gene, 4th Edition, 1 987, The 
Bejacmin/Cummings Pub. co., p. 224). 
10 Such substitutions are preferably made in accordance with those set forth 

in TABLE 2 as follows: 

TABLE 2 





Original residue 


Conservative substitution 




Ala (A) 


Gly; Ser 


15 


Arg <R) 


Lys 




Asn (N) 


Gin; His 




Cys (C) 


Ser 




Gin (Q) 


Asn 




Glu (E) 


Asp 


20 


Gly (G> 


Ala; Pro 




His (H) 


Asn; Gin 




lie (1) 


Leu; Val 




Leu (L) 


lie; Val 




Lys (K) 


Arg; Gin; Glu 


25 


Met (M) 


Leu; Tyr; He 




Phe (F) 


Met; Leu; Tyr 




Ser (S) 


Thr 




Thr <T> 


Ser 




Trp (W) 


Tyr 


30 


Tyr (Y) 


Trp; Phe 




Val (V) 


He; Leu 



Other substitutions are also permissible and may be determined empirically or in 
accord with known conservative substitutions. 
35 As used herein, a complementing plasmid describes plasmid vectors that 

deliver nucleic acids into a packaging cell line for stable integration into a 
chromosome in the cellular genome. 

As used herein, a delivery plasmid is a plasmid vector that carries or 
delivers nucleic acids encoding a therapeutic gene or gene that encodes a 
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therapeutic product or a precursor thereof or a regulatory gene or other factor 
that results in a therapeutic effect when delivered in vivo in or into a cell line, 
such as, but not limited to a packaging cell line, to propagate therapeutic viral 
vectors. 

5 As used herein, a variety of vectors with different requirements are 

described. For example, one vector is used to deliver particular nucleic acid 
molecules into a packaging cell line for stable integration into a chromosome. 
These types of vectors are generally identified herein as complementing 
plasmids. A further type of vector described herein carries or delivers nucleic 

10 acid molecules in or into a cell line {e.g., a packaging cell line) for the purpose of 
propagating therapeutic viral vectors; hence, these vectors are generally referred 
to herein as delivery plasmids. A third "type" of vector described herein is used 
to carry nucleic acid molecules encoding therapeutic proteins or polypeptides or 
regulatory proteins or are regulatory sequences to specific cells or cell types in a 

15 subject in need of treatment; these vectors are generally identified herein as 
therapeutic viral vectors or recombinant adenoviral vectors or viral Ad-derived . 
vectors and are in the form of a virus particle encapsulating a viral nucleic acid 
containing an expression cassette for expressing the therapeutic gene. 

As used herein, a DNA or nucleic acid homolog refers to a nucleic acid 

20 that includes a preselected conserved nucleotide sequence, such as a sequence 
encoding a therapeutic polypeptide. By the term "substantially homologous" is 
meant having at least 80%, preferably at least 90%, most preferably at least 
95% homology therewith or a lesser percentage of homology or identity and 
conserved biological activity or function. 

25 The terms "homology" and "identity" are often used interchangeably. In 

this regard, percent homology or identity may be determined, for example, by 
comparing sequence information using a GAP computer program. The GAP 
program utilizes the alignment method of Needleman and Wunsch [J. Mol. Biol, 
48:443 (1970), as revised by Smith and Waterman {Adv. AppL Math. 2:482 

30 11981). Briefly, the GAP program defines similarity as the number of aligned 

symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
number of symbols in the shorter of the two sequences. The preferred default 
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parameters for the GAP program may include: (1) a unary comparison matrix 
(containing a value of 1 for identities and 0 for non-identities) and the weighted 
comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 14:6745 (1986), 
as described by Schwartz and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE 
5 AND STRUCTURE, National Biomedical Research Foundation, pp. 353-358 

(1979); (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each 
symbol in each gap; and (3) no penalty for 
end gaps. 



10 are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
determined using known computer algorithms such as the "FAST A" program, 
using for example, the default parameters as in Pearson and Lipman, Proc. NatL 
Acad. Sci. USA 85:2444 (1988). Alternatively the BLAST function of the 
National Center for Biotechnology Information database may be used to 

15 determine identity. 

l n genera!, sequences are aligned so that the highest order match is 
obtained. "Identity" perse has an art-recognized meaning and can be calculated 
using published techniques. (See, e.g. : Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 

20 and Genome Projects, Smith, D.W., ed. f Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Parti, Griffin, A.M., and Griffin, H.G., 
eds„ Humana Press, New Jersey, 1 994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). 

25 While there exist a number of methods to measure identity between two 

polynucleotides or polypeptide sequences, the term "identity" is well known to 
skilled artisans (Carillo, H. & Upton, D., SIAM J Applied Math 48:1073 (1988)). 
Methods commonly employed to determine identity or similarity between two 
sequences include, but are not limited to, those disclosed in Guide to Huge 

30 Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and 
Carillo, H. & Upton, D., SIAM J Applied Math 43:1073 (1988). Methods to 
determine identity and similarity are codified in computer programs. Preferred 



Whether any two nucleic acid molecules have nucleotide sequences that 
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computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, GCG program package (Devereux, J. f 
etal.. Nucleic Acids Research 12(I):ZB1 (1984)), BLASTP, BLASTN, FASTA 
(Atschul, S.F., et at., J Moiec Biol 275:403 (1 990)). 
5 Therefore, as used herein, the term "identity" represents a comparison 

between a test and a reference polypeptide or polynucleotide. For example, a 
test polypeptide may be defined as any polypeptide that is 90% or more 
identical to a reference polypeptide. As used herein, the term at least "90% 
identical to" refers to percent identities from 90 to 99.99 relative to the 

10 reference polypeptides. Identity at a level of 90% or more, is indicative of the 
fact that, assuming for exemplification purposes a test and reference 
polynucleotide length of 100 amino acids are compared. No more than 10% 
(i.e., 10 out of 100) amino acids in the test polypeptide differs from that of the 
reference polypeptides. Similar comparisons may be made between a test and 

1 5 reference polynucleotides. Such differences may be represented as point 

mutations randomly distributed over the entire length of an amino acid sequence 
or they may be clustered in one or more locations of varying length up to the 
maximum allowable, e.g. 10/100 amino acid difference (approximately 90% 
identity). Differences are defined as nucleic acid or amino acid substitutions, or 

20 deletions. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 
into the selected target cells in a manner such that the heterologous DNA is 

25 expressed and a therapeutic product encoded thereby is produced. 

Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 
of a therapeutic product. Genetic therapy may also be used to deliver nucleic 

30 acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
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factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 
receptor therefor, that is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
5 prior to introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

As used herein, heterologous DNA is DNA that encodes RNA and proteins 
that are not normally produced in vivo by the cell in which it is expressed or that 
mediates or encodes mediators that alter expression of endogenous DNA by 

10 affecting transcription, translation, or other regulatable biochemical processes. 
Heterologous DNA may also be referred to as foreign DNA. Any DNA that one 
of skill in the art would recognize or consider as heterologous or foreign to the 
cell in which it is expressed is herein encompassed by heterologous DNA. 
Examples of heterologous DNA include, but are not limited to, DNA that encodes 

15 traceable marker proteins, such as a protein that confers drug resistance, DNA 
that encodes therapeutically effective substances, such as anti-cancer agents, 
enzymes and hormones, and DNA that encodes other types of proteins, such as 
antibodies. Antibodies that are encoded by heterologous DNA may be secreted 
or expressed on the surface of the cell in which the heterologous DNA has been 

20 introduced. 

Hence, herein heterologous DNA or foreign DNA, refers to a DNA 
molecule not present in the exact orientation and position as the counterpart 
DNA molecule found in the corresponding wild-type adenovirus. It may also 
refer to a DNA molecule from another organism or species [i.e., exogenous) or 

25 from another Ad serotype. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 

30 Typically, DNA encoding the desired heterologous DNA is cloned into a 

plasmid vector and introduced by routine methods, such as calcium-phosphate 
mediated DNA uptake (see, (1981) Somwt Cell. Mol. Genet. 7:603-616) or 



WO 01/83729 



PCT/EP01/04863 



-14- 

microinjection, into producer cells, such as packaging cells. After amplification 
in producer cells, the vectors that contain the heterologous DNA are introduced 
into selected target cells. 

As used herein, an expression or delivery vector refers to any piasmid or 
5 virus into which a foreign or heterologous DNA may be inserted for expression in 
a suitable host cell — i.e., the protein or polypeptide encoded by the DNA is 
synthesized in the host cell's system. Vectors capable of directing the 
expression of DNA segments (genes) encoding one or more proteins are referred 
to herein as "expression vectors." Also included are vectors that allow cloning 
10 of cDNA (complementary DNA) from mRNAs produced using reverse 
transcriptase. 

As used herein, a gene is a nucleic acid molecule whose nucleotide 
sequence encodes RNA or polypeptide. A gene can be either RNA or DNA. 
Genes may include regions preceding and following the coding region (leader and 

1 5 trailer) as well as intervening sequences (introns) between individual coding 
segments (exons). 

As used herein, tropism with reference to an adenovirus refers to the 
selective inf activity or binding that is conferred on the particle by the fiber 
protein, such as by the C-terminus portion that comprises the knob, 

20 As used herein, isolated with reference to a nucleic acid molecule or 

polypeptide or other biomolecule means that the nucleic acid or polypeptide has 
separated from the genetic environment from which the polypeptide or nucleic 
acid were obtained. It may also mean altered from the natural state. For 
example, a polynucleotide or a polypeptide naturally present in a living animal is 

25 not "isolated," but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed 
herein. Thus, a polypeptide or polynucleotide produced and/or contained within 
a recombinant host cell is considered isolated. Also intended as an "isolated 
polypeptide" or an "isolated polynucleotide" are polypeptides or polynucleotides 

30 that have been purified, partially or substantially, from a recombinant host cell or 
from a native source. For example, a recombinantiy produced version of a 
compound can be substantially purified by the one-step method described in 
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Smith and Johnson, Gene 57;31-40 (1988). The terms isolated and purified are 
sometimes used interchangeably. 

Thus, by "isolated" is meant that the nucleic acid is free of the coding 
sequences of those genes that, in the naturally-occurring genome of the 
5 organism (if any) immediately flank the gene encoding the nucleic acid of 

interest. Isolated DNA may be single-stranded or double-stranded, and may be 
genomic DNA, cDNA, recombinant hybrid DNA, or synthetic DNA. It may be 
identical to a native DNA sequence, or may differ from such sequence by the 
deletion, addition, or substitution of one or more nucleotides. 
10 Isolated or purified as it refers to preparations made from biological cells 

or hosts means any cell extract containing the indicated DNA or protein including 
a crude extract of the DNA or protein of interest. For example, in the case of a 
protein, a purified preparation can be obtained following an individual technique 
or a series of preparative or biochemical techniques and the DNA or protein of 
15 interest can be present at various degrees of purity in these preparations. The 
procedures may include for example, but are not limited to, ammonium sulfate 
fractionation, gel filtration, ion exchange chromatography, affinity 
chromatography, density gradient centrifugation and electrophoresis. 

A preparation of DNA or protein that is "substantially pure" or "isolated" 
20 should be understood to mean a preparation free from naturally occurring 
materials with which such DNA or protein is normally associated in nature. 
"Essentially pure" should be understood to mean a "highly" purified preparation 
that contains at least 95% of the DNA or protein of interest. 

A cell extract that contains the DNA or protein of interest should be 
25 understood to mean a homogenate preparation or cell-free preparation obtained 
from cells that express the protein or contain the DNA of interest. The term 
"cell extract" is intended to include culture media, especially spent culture media 
from which the cells have been removed. 

As used herein, a packaging cell line is a cell line that provides a missing 
30 gene product or its equivalent. 

As used herein, an adenovirus viral particle is the minimal structural or 
functional unit of a virus. A virus can refer to a single particle, a stock of 
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particles or a viral genome. The adenovirus (Ad) particle is relatively complex 
and may be resolved into various substructures. 

As used herein, "penton" or "penton complex" are preferentially used 
herein to designate a complex of penton base and fiber. The term "penton" may 
5 also be used to indicate penton base, as well as penton complex. The meaning 
of the term "penton" alone should be clear from the context within which it is 
used. 

As used herein, a plasmid refers to an autonomous self-replicating 
extrachromosomal circular nucleic acid molecule, typically DNA. 

10 As used herein, a post-transcription regulatory element (PRE) is a 

regulatory element found in viral or cellular messenger RNA that is not spliced, 
i.e. intronless messages. Examples include, but are not limited to, human 
hepatitis virus, woodchuck hepatitis virus, the TK gene and mouse histone gene. 
The PRE may be placed before a polyA sequence and after a heterologous DNA 

15 sequence. 

As used herein, pseudotyping describes the production of adenoviral 
vectors having modified capsid protein or capsid proteins from a different 
serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 fiber protein. This may be 

20 accomplished by producing the adenoviral vector in packaging cell lines 
expressing different fiber proteins. 

As used herein, promoters of interest herein may be inducible or 
constitutive. Inducible promoters will initiate transcription only in the presence 
of an additional molecule; constitutive promoters do not require the presence of 

25 any additional molecule to regulate gene expression, a regulatable or inducible 
promoter may also be described as a promoter where the rate or extent of RNA 
polymerase binding and initiation is modulated by external stimuli. Such stimuli 
include, but are not limited to various compounds or compositions, light, heat, 
stress and chemical energy sources. Inducible, suppressible and repressive 

30 promoters are considered regulatable promoters. Preferred promoters herein, are 
promoters that are selectively expressed in ocular cells, particularly 
photoreceptor cells. 
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As used herein, receptor refers to a biologically active molecule that 
specifically or selectively binds to (or with) other molecules. The term "receptor 
protein" may be used to more specifically indicate the proteinaceous nature of a 
specific receptor. 

5 As used herein, recombinant refers to any progeny formed as the result of 

genetic engineering. This may also be used to describe a virus formed by 
recombination of plasmids in a packaging cell. 

As used herein, a transgene or therapeutic nucleic acid molecule includes 
DNA and RNA molecules encoding an RNA or polypeptide. Such molecules may 

10 be "native" or naturally-derived sequences; they may also be "non-native" or 
"foreign" that are naturally- or recombinantly-derived. The term "transgene," 
which may be used interchangeably herein with the term "therapeutic nucleic 
acid molecule," is often used to describe a heterologous or foreign (exogenous) 
gene that is carried by a viral vector and transduced into a host cell. 

T5 Therefore, therapeutic nucleotide nucleic acid molecules include antisense 

sequences or nucleotide sequences which may be transcribed into antisense 
sequences. Therapeutic nucleotide sequences (or transgenes) all include nucleic 
acid molecules that function to produce a desired effect in the cell or cell nucleus 
into which said therapeutic sequences are delivered. For example, a therapeutic 

20 nucleic acid molecule can include a sequence of nucleotides that encodes a 
functional protein intended for delivery into a cell which is unable to produce 
that functional protein. 

As used herein, the vitreous of the eye refers to material that fills the 
chamber behind the lens of the eye (i.e., vitreous humor or vitreous body). 

25 As used herein, a promoter region refers to the portion of DNA of a gene 

that controls transcription of the DNA to which it is operatively linked. The 
promoter region includes specific sequences of DNA that are sufficient for RNA 
polymerase recognition, binding and transcription initiation. This portion of the 
promoter region is referred to as the promoter. In addition, the promoter region 

30 includes sequences that modulate this recognition, binding and transcription 

initiation activity of the RNA polymerase. These sequences may be c/s acting or 
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may be responsive to trans acting factors. Promoters, depending upon the 
nature of the regulation, may be constitutive or regulated. 

Thus, promoters are nucleic acid fragments that contain a DNA sequence 
that controls the expression of a gene located 3' or downstream of the 
5 promoter. The promoter is the DNA sequence to which RNA polymerase 
specifically binds and initiates RNA synthesis (transcription) of that gene, 
typically located 3' of the promoter. A promoter also includes DNA sequences 
that direct the initiation of transcription, including those to which RNA 
polymerase specifically binds. If more than one nucleic acid sequence encoding a 

10 particular polypeptide or protein is included in a therapeutic viral vector or 
nucleotide sequence, more than one promoter or enhancer element may be 
included, particularly if that would enhance efficiency of expression. 

A regulatabie or inducible promoter may be described as a promoter 
wherein the rate of RNA polymerase binding and initiation is modulated by 

15 external stimuli, (see, e.g., U.S. Patent Nos. 5,750,396 and 5,998,205). Such 
stimuli include various compounds or compositions, light, heat, stress, chemical 
energy sources, and the like. Inducible, suppressive and repressive promoters 
are considered regulatabie promoters. 

Regulatabie promoters may also include tissue-specific promoters. 

20 Tissue-specific promoters direct the expression of the gene to which they are 
operably linked to a specific cell type. Tissue-specific promoters cause the gene 
located 3' of it to be expressed predominantly, if not exclusively, in the specific 
cells where the promoter expressed its endogenous gene. Typically, it appears 
that if a tissue-specific promoter expresses the gene located 3' of it at all, then 

25 it is expressed appropriately in the correct cell types (see, e.g., Palmiter et al. 
(1986) Ann. Rev. Genet. 20: 465-499). 

As used herein, the phrase "operatively linked" generally means the 
sequences or segments have been covalently joined into one piece of DNA, 
whether in single or double stranded form, whereby control sequences on one 

30 segment control expression or replication or other such control of other 
segments. The two segments are not necessarily contiguous. 
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As used herein, exogenous encompasses any therapeutic composition 
that is administered by the therapeutic methods provided herein. Thus, 
exogenous may also be referred to herein as foreign, or non-native or other 
equivalent expression. 
B. Ad37 fiber troplsm 

The adenovirus fiber protein is a major determinant of adenovirus tropism 
(GailefaA (1996) J. Virol. 70:21 1 6-21 23; Stevenson etal. (1 995) J. ViroL 
65:2850-2857). The fiber protein extends from the capsid and mediates viral 
binding to the cell surface by binding to specific cell receptors (Philipson etal. 
(1968) J. ViroL 2:1064-1075). The fiber is a trimeric protein that includes an N- 
terminal tail domain that interacts with the adenovirus penton base, a central 
shaft domain of varying length, and a C-terminal knob domain that contains the 
cell receptor binding site (Chroboczek et aL (1995) Curr.Top.MicrobioLlmmunoL 
733:163-200; Riurok etal. (1990) J.Mol.BioL 2/5:589-596; Stevenson etal. 
(1995) J. ViroL 63:2850-2857). Fiber proteins of most adenovirus subgroups 
have been shown to bind specifically or selectively to the 46 kDa coxsackievirus- 
adenovirus receptor (CAR), (Bergelson et aL (1997) Science 276:1320-1323; 
Roelvink et aL (1998) J. ViroL 72:7909-7915). CAR appears to be expressed in 
a variety of human tissues, including the lung, at various levels (Bergelson et aL 
(1997) Science 275:1320-1323), but Ad37 binds poorly to lung epithelial cells 
(Huang etal. (1999) J. ViroL 73:2798-2802). This suggests that the tropism of 
this serotype may be influenced by factors independent of CAR expression. 

Structural and biochemical data also suggest that distinct receptor binding 
sites are located on different regions of the Ad5 and Ad37 fiber knobs. 
Adopting the nomenclature of Xia et aL (Xia et aL (1994) Structure 2:1259- 
1270), the receptor binding site for Ad5 is located at the AB-loop on the side of 
the fiber knob (Bewley etal. (1999) Science 256:1579-1583; Roelvink etal. 
(1 999) Science 266:1 568-1 571 ). it is known that a lysine residue at position 
240 of the Ad37 fiber, located in the CD-loop, is important for receptor binding 
(Huang et aL (1999) J. ViroL 73:2798-2802). The co-crystal structure of the 
Ad12 knob and the N-terminal domain of CAR (Bewley et aL (1999) Science 
266:1 579-1 583) show that the CD-loop does not contact CAR. It thus appears 
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that different regions of the Ad5 and Ad37 fiber knobs recognize distinct cell 
receptors. 

A 46 kDa receptor for coxsackieviruses and adenoviruses (CAR) mediates 
attachment for many adenovirus serotypes. The wide distribution of CAR fails 
to explain why certain adenovirus serotypes (i.e. Ad37) are highly associated 
with severe ocular infections such as epidemic keratoconjunctivitis (EKC). Ad37 
does not use CAR, but instead uses a glycoprotein that contains sialic acid as its 
primary receptor (Arnberg et al. ((2000) J. Virol. 74:42-48). The modest 
number of Ad37 binding sites per cell (Huang eta/. (1999) J. Virol. 73:2798- 
2802) also suggests that Ad37 recognizes a specific glycoprotein as its primary 
receptor for binding to conjunctival cells. 

Adenovirus type 37 (subgroup D) has been associated with infections of 
the eye and genital tract. The tropism of Ad37 derives from the binding 
pre f er ence of its fiber protein, which binds to a receptor located on the surface 
of cells including Chang C, conjunctival epithelial cell line (Huang et al. (1999) J. 
Virology 73:2798-2802). 

A protein receptor that is preferentially expressed on conjunctival cells to 
which Ad37 fiber. binds is shown herein. The preferential expression of the 
Ad37 receptor protein on conjunctival cells suggests that this receptor likely 
influences Ad37 tropism and should play a key role in ocular pathogenesis. It is 
shown herein that Ad37 uses a distinct protein receptor that is selectively 
expressed on conjunctival cells. It is shown that Ad37 binds well to conjunctival 
cells (Chang C), but poorly to lung carcinoma cells (A549). To determine if 
infection correlated with cell binding, an Ad5 vector containing the Ad37 fiber 
protein was constructed. The 'pseudotyped' vector delivered transgenes to 
Chang C cells better than to A549 cells. Ad37 binding was abolished by 
protease treatment of Chang C cells, indicating the receptor is a membrane 
protein. Ad37 binding to conjunctiva! cells is shown herein to be calcium- 
dependent. It is also shown that Ad37 infection was not inhibited by a function- 
blocking anti-CAR monoclonal antibody, which is a feature distinct from Ad5 
fiber interaction with CAR. Using a virus overlay protein blot assay (VOPBA), 
calcium-dependent Ad37 binding to a 50 KDa membrane protein on Chang C 
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cells, but not A549 cells was detected. Ad19p a closely related serotype that 
fails to bind to conjunctival cells, does not recognize the 50 kDa protein. 
Together, these data indicate that the 50 kDa protein is a candidate receptor for 
Ad37 on conjunctival cells. 
5 Significantly, it is also shown herein that, upon administration of the 

vector to the vitreous humor, the recombinant adenovirus with the Ad37 fiber 
preferentially and selectively binds to photoreceptor cells. Hence, a recombinant 
adenoviral delivery vehicle that has an Ad37 fiber protein can serve as a vector 
for delivery of therapeutic agents to the eye for treatment of ocular disorders, 

10 including genetic and acquired disorders. The identification of the receptor for 
Ad37 and the resulting recognition of Ad37 tropism allows targeting of 
adenovirus vectors to specific human ocular cells. 

As noted, fiber plays a crucial role in adenovirus infection by attaching 
the virus to a specific receptor on a cell surface. Hexon, penton and fiber 

15 capsomeres are the major components on the surface of the virion. The fiber is 
an elongated protein which exists as a trimer of three identical polypeptides 
(polypeptide IV) of 582 amino acids in length. An adenovirus fiber includes 
three domains: an N-terminal tail domain that interacts with penton base; a 
shaft composed of variable numbers of repeats of a 1 5-amino-acid segment that 

20 forms beta-sheet and beta-bends; and a knob at the C-terminus ("head domain") 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. The gene encoding the fiber protein from Ad2 has been 
expressed in human cells and has been shown to be correctly assembled into 
trimers, glycosylated and transported to the nucleus (see, e.g., Hong and Engler, 

25 Virology 785: 758-761 , 1 991). Thus, alteration of the fiber in recombinant Ad 
vectors can lead to alteration in gene delivery. 

As shown herein, alteration of fiber in recombinant Ad vectors such that 
the fiber is derived from Ad37 or another adenovirus serotype D, provides a 
means for selective delivery of a recombinant virus to particular cells in the eye, 

30 including conjunctival cells, and most significantly photoreceptors, thereby 
providing a means for targeted delivery to photoreceptor cells. 
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Photoreceptor cells are implicated in a number of hereditary and acquired 
retinal degenerative disorders. In addition, photoreceptor cells are located such 
that products produced therein can be delivered to other areas of the eye by 
virtue of the blood flow in the vicinity of the photoreceptor cells and also by 
5 virtue of the proximity of the photoreceptors to the retinal pigmented epithelium 
(RPE) and other retinal cells. 

Hence it is contemplated herein that the recombinant viral vector will 
include a packaged recombinant adenovirus genome containing at least the 
minimal elements for replication and packaging; heterologous DNA encoding a 

10 desired gene product, typically a therapeutic product or plurality of products, 

such as several trophic factors, whose combined activity is effective for treating 
a disorder, such as a retina! degenerative disorder; and the resulting virion 
particles will include a fiber that has a sufficient portion to confer specific 
targeting to photoreceptor cells when the recombinant viral particles are 

15 introduced into the aqueous humor of a mammalian, preferably a human, eye, or 
otherwise contacted with the photoreceptor cells. The fiber may be a chimeric 
protein that has been modified for effective interaction with other coat structural 
proteins, such as penton. In addition, the fiber may be modified to. include other 
elements that alter its tropism to permit binding to other cells as well (see, e.g.. 

20 U.S. Patent Nos. 5,756,086 and 5,543,328, International PCT application No. 
WO 95/26412 and WO 98/44121 and Krasnykh, et al. U Virol. 70: 6839-46, 
1996). 

C. Construction of the viral particles 

1 . Selection of viral genome and fiber protein 

25 Methods for preparing recombinant adenoviral vectors for gene 

product delivery are well known. Preferred among those are the methods 
exemplified herein (see EXAMPLES) and also described in copending U.S. 
application Serial No. 09/482,682 (also filed as International PCT application 
No. PCT/US00/00265, filed January 14, 2000, which claims priority to U.S. 

30 provisional application Serial No. 60/1 15,920, as does U.S. application Serial 
No. 09/482,682)). 
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As noted, any desired recombinant adenovirus is contemplated for use in 
the methods herein as long as the viral genome is packaged in a capsid that 
includes at least the portion of a fiber protein that provides selective binding to 
photoreceptor cells. This fiber protein is preferably from an adenovirus type D 
5 serotype and is preferably an Ad37 fiber. The fiber protein should retain the 
knob region at the C-terminus ("head domain") from the Ad virus of subgroup D 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. Hence the fiber protein can be a chimeric fiber protein as long 
as it retains a sufficient portion of the type D serotype to specifically or 

10 selectively bind to photoreceptor cells. Generally the portion retained will be all 
or a portion of the knob region. The precise amount of knob region required can 
be determined empirically by including portions thereof and identifying the 
minimum residues from and Ad type D serotype, preferably Ad37, to effect 
selective targeting of a virion packaged with such fiber to photoreceptors in the 

15 eye upon introduction of the packaged virion into the aqueous humor. 

Recombinant adenovirus containing heterologous nucleic acids that 
encode a desired product, such a gene to correct a genetic defect, may be made 
by any methods known to those of skill in the art. The viruses must be 
packaged in a cell line that results in expression of fiber on the particles that 

20 specifically, electively or preferentially targets (binds and results in 

internalization) the viral particle to cells in the eye. The fiber protein from Ad37 
and other Adenoviruses of serotype D that infect the eye effects such targeting. 
The resulting adenovirus particles that express such fiber is administered by 
intraocular injection, subretinal injection, particularly intravitreal injection, or any 

25 means that results in preferential accumulation in photoreceptor cells. 

The family of Adenoviridae includes many members with at least 47 
known serotypes of human adenovirus (Ad1-Ad47) (Shenk, Virology. Chapter 
67, in Fields et a/., eds. Lippincott-Raven, Philadelphia, 1 996,) as well as 
members of the genus Mastadenovirus including human, simian, bovine, equine, 

30 porcine, ovine, canine and opossum viruses and members of the Aviadenovirus 
genus, including bird viruses, such as CELO. 
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Thus it is contemplated that the methods herein can be applied to any 
recombinant viral vectors derived from any adenovirus species. One of skill in 
the art would have knowledge of the different adenoviruses (see, e.flr.,Shenk, 
Virology, Chapter 67, in Fields eta/., eds. Lippincott-Raven, Philadelphia, 1996,) 
5 and can construct recombinant viruses containing portions of the genome of any 
such virus. 

In the exemplified embodiment, viral particles with Ad37 fiber were 
prepared. Site-directed mutations were made to the Ad37 fiber gene to make 
the tail sequence more closely match that of Ad5 to facilitate Ad37 fiber binding 

10 to the Ad5 penton base. The plasmid for the expression of the Ad37 fiber 

protein, pDV80, contains the CMV promoter, the adenovirus type 5 tripartite 
leader (TPL), and the modified Ad37 fiber gene sequence. Genes of interest, 
such as nucleic acid encoding the p subunit of cGMP phosphodiesterase (£PDE), 
^-glucuronidase, rhodopsin, growth factors, anti-cancer agents, growth factor 

1 5 receptors and other anti-angiogenic agents, and anti-apoptotic agents, can be 
incorporated into these vectors using the methods known to those of skill in the 
art and exemplified herein. 

Known adenovirus vectors, previously constructed for intraocular therapy 
(see, e.g., Bennett et at. (1996) Nature Medicine 2:649-654, which provides an 

20 Ad virus encoding 0PDE for treatment of retinitis pigmentosa; Cayouette et aL 
(1998) Human Gene Therapy 5:423-430, which provides an Ad vector that 
expresses CNTF for treatment of retinitis pigmentosa and other retinal 
degenerative diseases; and Li et aL (1995) Proc. Natl. Acad. ScL U.S.A. 
52:7700-7704, which provides an Ad virus vector that encodes a human 0- 

25 glucuronidase for treatment of lysosomal storage disease caused by p- 

glucuronidase deficiency) can be modified by repackaging the recombinant 
genome using a packaging line that expresses an Ad37 fiber or other D serotype 
fiber. 

For exemplification, nucleic acid encoding GFP was incorporated into 
30 these vectors as a means to visualize their localization. Other genes, such as 

genes that encode therapeutic products, my be included in place of or in addition 
to GFP. 
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Plasmid pDV80 was electroporated into E1-2a S8 cells and stable lines 
were selected. The fiber-deleted vectors Ad5.£gal.AF and Ad5.GFP.AF were 
grown in cells in a resulting eel! line, designated 705, to produce virions, which 
express the Ad37 fiber (Ad5.£ga1.AF/37F and Ad5.GFP.AF/37F) and CsCI- 
5 purified. These virions selectively transduce photoreceptor cells when injected 
intraocularly into the vitreous humor. 
2. Packaging 

Recombinant adenoviral vectors generally have at least a deletion in the 
first viral early gene region, referred to as E1, which includes the E1a and E1b 
10 regions. Deletion of the viral E1 region renders the recombinant adenovirus 

defective for replication and incapable of producing infectious viral particles in 
subsequently-infected target cells. Thus, to generate E1 -deleted adenovirus 
genome replication and to produce virus particles requires a system of 
complementation which provides the missing E1 gene product. E1 
15 complementation is typically provided by a cell line expressing E1, such as the 
human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 
293. Cell line 293 contains the E1 region of adenovirus, which provides E1 
gene region products to "support" the growth of E1 -deleted virus in the cell line 
(see, e.g., Graham eta/., J. Gen. Virol. 36: 59-71, 1977). Additionally, cell lines 
20 that may be usable for production of defective adenovirus having a portion of 
the adenovirus E4 region have been reported (WO 96/22378). 

Multiply deficient adenoviral vectors and complementing cell lines have 
also been described (WO 95/34671, U.S. Patent No. 5,994,106). 

Copending U.S. application Serial No. 09/482,682 (also filed as 
25 International PCT application No. PCT/US00/00265, filed January 14, 2000)) 
provides packaging cell lines that support viral vectors with deletions of major 
portions of the viral genome, without the need for helper viruses and also 
provides cell lines and helper viruses for use with helper-dependent vectors. 
The packaging cell line has heterologous DNA stably integrated into the 
30 chromosomes of the cellular genome. The heterologous DNA sequence encodes 
one or more adenovirus regulatory and/or structural polypeptides that 
complement the genes deleted or mutated in the adenovirus vector genome to 




WO 01/83729 



PCT/EP01/04863 



-26- 



10 



15 



20 



25 



be replicated and packaged. The packaging cell line express, for example, one 
or more adenovirus structural proteins, polypeptides, or fragments thereof, such 
as penton base, hexon, fiber, polypeptide Ilia, polypeptide V, polypeptide VI, 
polypeptide Vll, polypeptide VIII, and biologically active fragments thereof. The 
expression can be constitutive or under the control of a regulatable promoter. 
These cell lines are designed for expression of recombinant adenoviruses 
intended for delivery of therapeutic products. 

Particular packaging cell lines complement viral vectors having a deletion 
or mutation of a DNA sequence encoding an adenovirus structural protein, 
regulatory polypeptides E1A and E1B, and/or one or more of the following 
regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, or. fragments thereof. 

The packaging cell lines are produced by introducing each DNA molecule 
into the cells and then into the genome via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
introduced via a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 
such as Ad 37, polypeptide or fragment thereof. 

For therapeutic applications, the delivery plasmid further includes a 
nucleotide sequence encoding a foreign polypeptide. Exemplary delivery 
piasmids include, but are not limited to, pDV44, pAE1B£-gal and pAE1sp1B. In 
a similar or analogous manner, therapeutic genes may be introduced. 

The cell further includes a complementing plasmid encoding a fiber as 
contemplated herein; the plasmid or portion thereof is integrated into a 
chromosome(s) of the cellular genome of the cell. 

in one embodiment, a composition comprises a cell containing first and 
second delivery piasmids wherein a first delivery plasmid comprises an 
adenovirus genome lacking a nucleotide sequence encoding fiber and incapable 
of directing the packaging of new viral particles in the absence of a second 
delivery plasmid, and a second delivery plasmid comprises an adenoviral genome 
capable of directing the packaging of new viral particles in the presence of the 
first delivery plasmid. 
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In a variation, the packaging cell line expresses fiber protein or chimeric 
variant thereof from an Ad virus of subgroup D, preferably Ad37. serotype or it 
can be any fiber protein but one that has been modified to include the portion of 
the Ad virus of subgroup D, such as Ad37, responsible for selective targeting to 
5 photoreceptors upon introduction into the vitreous humor of the eye of a 
mammal, preferably a human. The fiber protein can be further modified to 
include a non-native amino acid residue sequence that targets additional specific 
receptors. In all instances, the modification should not disrupt trimer formation 
or transport of fiber into the nucleus. In another variation, the non-native amino 
10 acid residue sequence alters the binding specificity of the fiber for a targeted cell 
type. The structural protein is fiber can include amino acid residue sequences 
from more than one adenovirus serotype. The nucleotide sequences encoding 
fiber protein or polypeptide need not be modified solely at one or both termini; 
fiber protein, may be modified "internally"' as well as at the termini. 
15 Additional nucleic acid fragments can encode polypeptides that are added 

to the fiber protein. In one variation, the non-native amino acid residue 
sequence is coupled to the carboxyl terminus of the fiber. In another, the non- 
native amino acid residue sequence further includes a linker sequence. 
Alternatively, the fiber protein further comprises a ligand coupled to the linker. 
20 Suitable ligands include, but are not limited to, ligands that specifically or 

• selectively bind to a cell surface receptor and ligands that can be used to couple 
other proteins or nucleic acid molecules. Typically, the packaging cell lines will 
contain nucleic acid encoding the fiber protein or modified protein stably 
integrated into a chromosome or chromosomes in the cellular genome. 
25 The packaging cell line can be derived from a procaryotic cell line or from 

a eukaryotic cell line. While various embodiments suggest the use of mammalian 
cells, and more particularly, epithelial cell lines, a variety of other, non-epithelial 
cell lines are used in various embodiments. Thus, while various embodiments 
disclose the use of a cell line selected from among the 293, A549, W162, HeLa, 
30 Vero, 21 1 , and 21 1 A cell lines, and any other cell lines suitable for such use are 
likewise contemplated herein. 

3. Components of the nucleic acid molecule included in the particle 
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A recombinant viral vector or therapeutic viral vector for use in the 
methods herein, typically includes a nucleic acid fragment that encodes a 
protein or polypeptide molecule, or a biologically active fragment thereof, or 
other regulatory sequence, that is intended for use in therapeutic applications. 
5 The nucleic acid molecule to be packaged in the viral particle also may 

include an enhancer element and/or a promoter located 3' or 5' to and 
controlling the expression of the therapeutic product-encoding nucleic acid 
molecule if the product is a protein. Further, for purposes herein, the promoter 
and/or other transcriptional and translational regulatory sequences controlling 

10 expression of the product is preferably one that is expressed specifically in the 
targeted ceils, such as the a photoreceptor-specific promoter, such as a 
rhodopsin gene promoter. 

The nucleic acid molecule to be packaged in viral capsid includes at least 
2 different operatively linked DNA segments. The DNA can be manipulated and 

1 5 amplified by PCR as described herein and by using standard techniques, such as 
those described in Molecular Cloning: A Laboratory Manual, 2nd Ed., Sambrook 
et aL, eds., Cold Spring Harbor, New York (1989). Typically, to produce such 
molecule, the sequence encoding the selected polypeptide and the promoter or 
enhancer are operatively linked to a DNA molecule capable of autonomous 

20 replication in a cell either In vivo or in vitro. By operatively linking the enhancer 
element or promoter and nucleic acid molecule to the vector, the attached 
segments are replicated along with the vector sequences. 

Thus, the recombinant DNA molecule (rDNA) is a hybrid DNA molecule 
comprising at least 2 nucleotide sequences not normally found together in 

25 nature. In various preferred embodiments, one of the sequences is a sequence 
encoding an Ad-derived polypeptide, protein, or fragment thereof. The nucleic 
acid molecule intended to be packaged is from about 20 base pairs to about 
40,000 base pairs in length, preferably about 50 bp to about 38,000 bp in 
length. In various embodiments, the nucleic acid molecule is of sufficient length 

30 to encode one or more adenovirus proteins or functional polypeptide portions 
thereof. Since individual Ad polypeptides vary in length from about 19 amino 
acid residues to about 967 amino acid residues, encoding nucleic acid molecules 
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from about 50 bp up to about 3000 bp, depending on the number and size of 
individual polypeptide-encoding sequences that are "replaced" in the viral 
vectors by therapeutic product-encoding nucleic acid molecules. 

Preferably the molecule includes an adenovirus tripartite leader (TPL) 
5 nucleic acid sequence operatively linked to an intron containing RNA processing 
signals (such as for example, splice donor or splice acceptor sites) suitable for 
expression in the packaging cell line. Most preferably the intron contains a splice 
donor site and a splice acceptor site. Alternatively, the TPL nucleotide sequence 
may not comprise an intron. The intron includes any sequence of 

10 nucleotides that function in the packaging cell line to provide RNA processing 
signals, including splicing signals. Introns have been well characterized from a 
large number of structural genes, and include but are not limited to a native 
intron 1 from adenovirus, such as Ad5's TPL intron 1; others include the SV40 
VP intron; the rabbit beta-globin intron, and synthetic intron constructs (see, 

15 e.g., Petitclerc et ai (1995U BiothechnoL, 40:169; and Choi et ah (19910 Moh 
Celt. Biol. , 1 /:3070). 

The nucleic acid molecule encoding the TPL includes either (a) first and 
second TPL exons or (b) first, second and third TPL exons, where each TPL exon 
in the sequence is selected from among the complete TPL exon 1, partial TPL 

20 exon 1, complete TPL exon 2 and complete TPL exon 3. A complete exon is 
one which contains the complete nucleic acid sequence based on the sequence 
found in the wild type viral genome. Preferably the TPL exons are from Ad2, 
Ad3, Ad5, Ad7 and the like, however, they may come from any Ad serotype, as 
described herein. A preferred partial TPL exon 1 is described in the Examples. 

25 The use of a TPL with a partial exon 1 has been reported (International PCT 
application No. WO 98/13499). 

The intron and the TPL exons can be operatively linked in a variety of 
configurations to provide a functional TPL nucleotide sequence. An intron may 
not be a part of the construct. For example, the intron can be positioned 

30 between any of TPL exons 1 , 2 or 3, and the exons can be in any order of first 
and second, or first/second/third. The intron can also be placed preceding the 
first TPL exon or following the last TPL exon. In a preferred embodiment, 
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complete TPL exon 1 is operativeiy linked to complete TPL exon 2 operatively 
linked to complete TPL exon 3. In a preferred variation, adenovirus TPL intron 1 
is positioned between complete TPL exon 1 and complete TPL exon 2. It may 
also be possible to use analogous translational regulators from other viral 
5 systems such as rabiesvirus. 

A preferred "complete" TPL nucleic acid molecule containing complete 
TPL exons 1 , 2 and 3 with adenovirus intron 1 inserted between exons 1 and 2 
has a nucleotide sequence shown in SEQ ID NO: 32. A preferred "partial" TPL 
nucleic acid molecule containing partial TPL exon 1 and complete TPL exons 2 
10 and 3 in that order has a nucleotide sequence shown in SEQ ID NO: 26. The 
construction of these preferred TPL nucleotide sequences is described in the 
Examples. 

Thus, preferred expression cassettes and complementing plasmids for 
expressing adenovirus structural genes, particularly fiber protein, contain an 
1 5 adenovirus TPL nucleotide sequence as described herein. 
4. Complementing Plasmids 

Also contemplated are the use of nucleic acid molecules, typically in the 
form of DNA plasmid vectors, which are capable of expression of an adenovirus 
structural protein or regulatory protein. Because these expression plasmids are 
20 used to complement the defective genes of a recombinant adenovirus vector 
genome, the plasmids are referred to as complementing or complementation 
plasmids. 

The complementing plasmid contains an expression cassette, a nucleotide 
sequence capable of expressing a protein product encoded by the nucleic acid 

25 molecule. Expression cassettes typically contain a promoter and a structural 

gene operatively linked to the promoter. The complementing plasmid can further 
include a sequence of nucleotides encoding TPL nucleotide to enhance 
expression of the structural gene product when used in the context of 
adenovirus genome replication and packaging. 

30 A complementing plasmid can include a promoter operatively linked to a 

sequence of nucleotides encoding an adenovirus structural polypeptide, such as, 
but are not limited to, penton base; hexon; fiber; polypeptide Ilia; polypeptide V; 
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poiypeptide VI; polypeptide VII; polypeptide VIII; and biologically active 
fragments thereof. In another variation, a complementing plasmid may also 
include a sequence of nucleotides encoding a first adenovirus regulatory 
polypeptide, a second regulatory polypeptide, and/or a third regulatory 
5 polypeptide, and any combination of the foregoing. 

. Plasmid pDV80 is a preferred plasmid herein. Other plasmids constructed 
in an analogous manner to encode modified fiber proteins and chimeric fiber 
proteins are also contemplated herein. 

5. Nucleic Acid Molecule Synthesis 

10 A nucleic acid molecule comprising synthetic oligonucleotides can be 

prepared using any suitable method, such as the phosphotriester or 
phosphodiester methods (see, e.g., Narang (1979) et a/., Meth. EnzymoL, 
68:90; U.S. Patent No. 4,356,270; and Brown eta/., (1979) Meth. EnzymoL, 
68:109). For oligonucleotides, the synthesis of the family members can be ' 

15 conducted simultaneously in a single reaction vessel, or can be synthesized 
independently and later admixed in preselected molar ratios. For simultaneous 
synthesis, the nucleotide residues that are conserved at preselected positions of 
the sequence of the family member can be introduced in a chemical synthesis 
protocol simultaneously to the variants by the addition of a single preselected 

20 nucleotide precursor to the solid phase oligonucleotide reaction admixture when 
that position number of the oligonucleotide is being chemically added to the 
growing oligonucleotide polymer. The addition of nucleotide residues to those 
positions in the sequence that vary can be introduced simultaneously by the 
addition of amounts, preferably equimolar amounts, of multiple preselected 

25 nucleotide precursors to the solid phase oligonucleotide reaction admixture 
during chemical synthesis. For example, where all four possible natural 
nucleotides (A,T,G and C) are to be added at a preselected position, their 
precursors are added to the oligonucleotide synthesis reaction at that step to 
simultaneously form four variants (see, e.g., Ausubel et al. (Current Protocols in 

30 Molecular Biology, Suppl. 8. p.2.1 1 .7, John Wiley & Sons, Inc., New York 
,1991). 
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Nucleotide bases other than the common four nucleotides (A,T,G or C), or 
the RNA equivalent nucleotide uracil (U), can also be used. For example, it is 
well known that inosine (I) is capable of hybridizing with A, T and G, but not C. 
Examples of other useful nucleotide analogs are known in the art and may be 
5 found referred to in 37 C.F.R. §1 .822. 

Thus, where all four common nucleotides are to occupy a single position 
of a family of oligonucleotides, that is, where the preselected nucleotide 
sequence is designed to contain oligonucleotides that can hybridize to four 
sequences that vary at one position, several different oligonucleotide structures 

10 are contemplated. The composition can contain four members, where a 
preselected position contains A,T,G or C. Alternatively, a composition can 
contain two nucleotide sequence members, where a preselected position 
contains I or C, and has the capacity to hybridize at that position to all four 
possible common nucleotides. Finally, other nucleotides may be included at the 

1 5 preselected position that have the capacity to hybridize in a non-destabilizing 
manner with more than one of the common nucleotides in a manner similar to 
inosine. 

Similarly, larger nucleic acid molecules can be constructed in synthetic 
oligonucleotide pieces, and assembled by complementary hybridization and 
20 ligation, as is well known. 

D. Adenovirus Expression Vector Systems 

The adenovirus vector genome that is encapsulated in the virus particle 
and that expresses exogenous genes in a gene therapy setting is a key 
component of the system. Thus, the components of a recombinant adenovirus 
25 vector genome include the ability to express selected adenovirus structural 
genes, to express a desired exogenous protein, and to contain 

sufficient replication and packaging signals that the genome is 
packaged into a gene delivery vector particle. The preferred 
replication signal is an adenovirus inverted terminal repeat 
30 containing an adenovirus origin of replication, as is well known 

and described herein. 
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Although adenovirus include many proteins, not all adenovirus proteins 
are required for assembly of a recombinant adenovirus particle (vector). Thus, 
deletion of the appropriate genes from a recombinant Ad vector permits 
accommodation of even larger "foreign" DNA segments. 

A preferred recombinant adenovirus vector genome is "helper 
independent" so that genome can replicate and be packaged without the help of 
a second, complementing helper virus. Complementation is provided by a 
packaging cell. 

In a preferred embodiment, the adenovirus vector genome does not 
encode a functional adenovirus fiber protein. A non-functional fiber gene refers 
to a deletion, mutation or other modification to the adenovirus fiber gene such 
that the gene does not express any or insufficient adenovirus fiber protein to 
package a fiber-containing adenovirus particle without complementation of the 
fiber gene by a complementing piasmid or packaging cell line. Such a genome is 
referred to as a "fiberless" genome, not to be confused with a fiberless particle. 
Alternatively, a fiber protein may be encoded but is insufficiently expressed to 
result in a fiber containing particle. 

Thus, contemplated for use are helper-independent fiberless recombinant 
adenovirus vector genomes that include genes that (a) express all adenovirus 
structural gene products but express insufficient adenovirus fiber protein to 
package a fiber-containing adenovirus particle without complementation of said 
fiber gene, (b) express an exogenous protein, and (c) contain an adenovirus 
packaging signal and inverted terminal repeats containing adenovirus origin of 
replication. 

The adenovirus vector genome is propagated in the laboratory in the form 
of rDNA plasmids containing the genome, and upon introduction into an 
appropriate host, the viral genetic elements provide for viral genome replication 
and packaging rather than plasmid-based propagation. Exemplary methods for 
preparing an Ad-vector genome are described in the Examples. 

A vector herein includes a nucleic acid {preferably DNA) molecule capable 
of autonomous replication in a cell and to which a DNA segment, e.g., a gene or 
polynucleotide, can be operatively linked to bring about replication of the 
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attached segment. For purposes herein, one of the nucleotide segments to be 
operatively linked to vector sequences encodes at least a portion of a therapeutic 
nucleic acid molecule. As noted above, therapeutic nucleic acid molecules 
include those encoding proteins and also those that encode regulatory factors 
5 that can lead to expression or inhibition or alteration of expression of a gene 
product in a targeted cell. 

1. Nucleic Acid Gene Expression Cassettes 
In various embodiments, a peptide-coding sequence of the therapeutic 
gene is inserted into an expression vector and expressed; however, it is also 

10 feasible to construct an expression vector which also includes some non-coding 
sequences as well. Preferably, however, non-coding sequences are excluded. 
Alternatively, a nucleotide sequence for a soluble form of a polypeptide may be 
utilized. Another preferred therapeutic viral vector includes a nucleotide 
sequence encoding at least a portion of a therapeutic nucleotide sequence 

15 operatively linked to the expression vector for expression of the coding sequence 
in the therapeutic nucleotide sequence. 

The choice of viral vector into which a therapeutic nucleic acid molecule 
is operatively linked depends directly, as is well known in the art, on the 
functional properties desired, e.g., vector replication and protein expression, and 

20 the host cell to be transformed - these being limitations inherent in the art of 
constructing recombinant DNA molecules. Although certain adenovirus 
serotypes are recited herein in the form of specific examples, it should be 
understood that the use of any adenovirus serotype, including hybrids and 
derivatives thereof are contemplated. 

25 A translatable nucleotide sequence is a linear series of nucleotides that 

provide an uninterrupted series of at least 8 codons that encode a polypeptide in 
one reading frame. Preferably, the nucleotide sequence is a DNA sequence. The 
vector itself may be of any suitable type, such as a viral vector {RNA or DNA), 
naked straight-chain or circular DNA, or a vesicle or envelope containing the 

30 nucleic acid material and any polypeptides that are to be inserted into the cell. 
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2. Promoters 

As noted elsewhere herein, an expression nucleic acid in an Ad-derived 
vector may also include a promoter, particularly a tissue or cell specific 
promoter, preferably one expressed in ocular cells, particularly photoreceptors, 
5 Promoters contemplaged for use herein include regulatable (inducible) as 

well as constitutive promoters, which may be used, either on separate vectors or 
on the same vector. Some useful regulatable promoters are those of the CREB- 
regulated gene family and include inhibin, gonadotropin, cytochrome c, 
glucagon, and the like. (See, e.g., International PCT application No. WO 

10 96/14061). Preferably the promoter selected is from a photoreceptor-specific 
gene, such as a rhodopsin gene or gene that encodes a protein that regulates 
rhodopsin expression. 
E. Formulation and administration 

Compositions containing therapeutically effective concentrations of 

15 recombinant adenovirus delivery vectors are provided. These are for delivery of 
therapeutic gene products to cells, particularly cells express a particular 50 kDa 
receptor or other receptor with which the vectors interact. These cells include 
cells of the eye and genital tract. Of particular interest are photoreceptor cells of 
the eye. Administration is effected by any means through which contacting with 

20 the photoreceptors is effected. Preferable modes of administration include, but 
are not limited to, subretinal injection, particularly intravitreal injection, to provide 
access to photoreceptor cells. 

The recombinant viral compositions may also be formulated for 
implantation into the anterior or posterior chamber of the eye, preferably the 

25 vitreous cavity, in sustained released formulations, such as those adsorbed to 
biodegradable supports, including collagen sponges, or in liposomes. Sustained 
release formulations may be formulated for multiple dosage administration, so 
that during a selected period of time, such as a month or up to about a year, 
several dosages are administered. Thus, for example, liposomes may be 

30 prepared such that a total of about two to up to about five or more times the 
single dosage is administered in one injection. 
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The vectors are formulated in an ophthalmologically acceptable carrier for 
intraocular, preferably intravitreal, administration in a volume of between about 
0.05 ml and 0.150 ml, preferably about 0.05 and 0.100 ml. 

The composition can be provided in a sealed sterile vial containing an 
5 amount of a compound of formula I, that upon intraocular administration will 
deliver a sufficient amount of viral particles to the photoreceptors in a volume of 
about 50 to 150 />!, containing at least about 10 7 , more preferably at least about 
10 s plaque forming units in such volume. Typically, the vials will, thus, contain 
about 0.150 mi of the composition. 
10 To prepare compositions the viral particles are dialzyed into a suitable 

ophthalmologically acceptable carrier or viral particles, for example, may be 
concentrated and/or mixed therewith. The resulting mixture may be a solution, 
suspension or emulsion. In addition, the viral particles may be formulated as 
the sole pharmaceutical^ active ingredient in the composition or may be 
15 combined with other active agents for the particular disorder treated. 

For administration by intraocular injection or via eyedrops, suitable 
carriers include, but are not limited to, physiological saline, phosphate buffered 
saline (PBS), balanced salt solution (BSS),. lactate Ringers solution, and solutions 
containing thickening and solubilizing agents, such as glucose, polyethylene 
20 glycol, and polypropylene glycol and mixtures thereof. Liposomal suspensions 
may also be suitable as pharmaceutical^ acceptable carriers. These may be 
prepared according to methods known to those skilled in the art. Suitable 
ophthalmologically acceptable carriers are known. Solutions or mixtures 
intended for ophthalmic use may be formulated as 0.01 % - 10% isotonic 
25 solutions, pH about 5-7, with appropriate salts [see, e.g., U.S. Patent No. 
5,1 16,868, which describes typical compositions of ophthalmic irrigation 
solutions and solutions for local application]. Such solutions, which have a pH 
adjusted to about 7.4, contain, for example, 90-100 mM sodium chloride, 4-6 
mM dibasic potassium phosphate, 4-6 mM dibasic sodium phosphate, 8-12 mM 
30 sodium citrate, 0.5-1.5 mM magnesium chloride, 1.5-2.5 mM calcium chloride, 
15-25 mM sodium acetate, 10-20 mM D.L.-sodium £-hydroxybutyrate and 5-5.5 
mM glucose. 
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The compositions may be prepared with carriers that protect them from 
rapid elimination from the body, such as time release formulations or coatings. 
Such carriers include controlled release formulations, such as, but not limited to, 
microencapsulated delivery systems, and biodegradable, biocompatible polymers, 
5 such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 

polyorthoesters, polylactic acid and other types of implants that may be placed 
directly into the anterior or posterior chamber or vitreous cavity of the eye. The 
compositions may also be administered in pellets, such as Elvax pellets 
(ethylene-vinyl acetate copolymer resin). 

10 Liposomal suspensions, including tissue-targeted liposomes, may also be 

suitable as pharmaceutically acceptable carriers. For example, liposome 
formulations may be prepared by methods known to those of skill in the art [see, 
e.g. , Kimm et aL d 983) Bioch. Bioph. Acta 728:339-398; Assil et aL (1 987) 
Arch Ophthalmol. 105:400; and U.S. Patent No. 4,522,81 1], The viral particles 

15 may be encapsulated into the aqueous phase of liposome systems. 

The active materials can also be mixed with other active materials, that 
do not impair the desired action, or with materials that supplement the desired 
action or have other action, including viscoelastic materials, such as hyaluronic 
acid, which is sold under the trademark HEALON, which is a solution of a high 

20 molecular weight (MW| of about 3 millions fraction of sodium hyaiuronate 
[manufactured by Pharmacia, Inc; see, e.g., U.S. Patent Nos. 5,292,362, 
5,282,851, 5,273,056, 5,229,127, 4,517,295 and 4,328,803], VISCOAT 
[fluorine-containing (meth)acrylates, such as, 1H,1H,2H,2H-hepta- 
decafluorodecylmethacrylate; see, e^, U.S. Patent Nos. 5,278,126, 5,273,751 

25 and 5,214,080; commercially available from Alcon Surgical, Inc.], ORCOLON 
[see, e.g., U.S. Patent No. 5,273,056; commercially available from Optical 
Radiation Corporation], methylcellulose, methyl hyaiuronate, polyacrylamide and 
polymethacrylamide [see, e^ U.S. Patent No. 5,273,751]. The viscoelastic 
materials are present generally in amounts ranging from about 0.5 to 5.0%, 

30 preferably 1 to 3% by weight of the conjugate material and serve to coat and 
protect the treated tissues. The compositions may also include a dye, such as 




# 



WO 01/83729 



PCT/EP01/04863 



-38- 



10 



15 



20 



25 



methylene blue or other inert dye, so that the composition can be seen when 
injected into the eye. Additional active agents may be included. 

The compositions can be enclosed in ampules, disposable syringes or 
multiple or single dose vials made of glass, plastic or other suitable material. 
Such enclosed compositions can be provided in kits. In particular, kits 
containing vials, ampules or other containers, preferably disposable vials with 
sufficient amount of the composition to deliver about 0.100 ml thereof, and 
disposable needles, preferably self sealing 25-30 gauge needles, are provided 
herein. 

Finally, the compounds may be packaged as articles of manufacture 
containing packaging material, typically a vial, an ophthalmologically acceptable 
composition containing the viral particles and a label that indicates the 
therapeutic use of the composition. 

Also provided are kits for practice of the methods herein. The kits 
contain one or more containers, such as sealed vials, with sufficient composition 
for single dosage administration, and one or more needles, such as self sealing 
25-33 gauge needles, preferably 33 gauge or smaller needles, precisely 
calibrated syringes or other precisely calibrated delivery device, suitable for 
intravitreal injection. 

Administration of the composition is preferably by intraocular injection, 
although other modes of administration may be effective, if the sufficient 
amount of the compound achieves contact with the vitreous cavity. Intraocular 
injection may be effected by intravitreal injection, aqueous humor injection or 
injection into the external layers of the eye, such as subconjunctival injection or 
subtenon injection, or by topical application to the cornea, if a penetrating 
formulation is used. 

Administration 

The compositions containing the compounds are administered 
intraocularly or by other means, such as topically in the form of penetrating 
eyedrops, whereby contact of the recombinant vectors with the aqueous humor 
is effected. Intraocular administration may be effected by intravitreal injection, 
aqueous humor injection, injection into the external layers of the eye, such as 
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subconjunctival injection or subtenon injection, preferably in free form, but, 
alternatively, in liposomes or other sustained drug delivery device. 
Administration is preferably by intravitreal injection, preferably through self 
sealing 25-30 gauge needles or other suitably calibrated delivery device. 
5 Injection into the eye may be through the pars plana via the self-sealing needle. 

It is further understood that, for any particular subject, specific dosage 
regimens should be adjusted over time according to the individual need and the 
professional judgment of the person administering or supervising the 
administration of the recombinant viruses, and that the concentration ranges set 
10 forth herein are exemplary only and are not intended to limit the scope or 
practice of the claimed methods 
F. Diseases, Disorders and therapeutic products 
1 . Disease and disorders 

Retinitis pigmentosa 

15 Methods for specifically or selectively targeting recombinant adenovirus 

vectors for delivery of gene products, particularly therapeutic products are 
provided herein. These methods are particularly suitable for targeting cells that 
express receptors that are selectively recognized by Ad virus of subgroup D 
viruses, particularly Ad37. It is shown herein that these viruses selectively 

20 recognize receptors on cells, such as conjunctival cells and photoreceptors, that 
are not recognized by other adenoviruses. Hence, methods for targeting to 
these cell types by providing vectors that are packaged in viral particles that 
contain a sufficient portion of a fiber protein from one of these Ad serotypes to 
bind to these receptors. These methods are useful for targeting to 

25 photoreceptors and for treating ocular disorders, including, but are not limited to, 
inherited and acquired retinal, neovascular degenerative diseases (see table 
below). 

It is estimated that 1 in 3,500 individuals in the United States suffer from 
one of the pigmented retinopathies. This group of retinal diseases/commonly 
30 called retinitis pigmentosa, is characterized by progressive loss of peripheral and 
night vision. Patients may be affected at almost any age and it is not 
uncommon to experience symptoms in early childhood in certain inherited forms. 
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It has been shown that there are a variety of mutations in genes expressed in 
the photoreceptors, including genes in the rhodopsin gene and pathway that 
appear to be responsible for these diseases. In addition to mutations in 
rhodopsin, changes in the retinal pigmented epithelial (RPE) cells, also undergo 
5 degenerative changes and can form clumps of pigment that give rise to the 
characteristic pigmentary changes seen in patients with RP. 

Angiogenesis and ocular diseases and disorders 
The vast majority of diseases that cause catastrophic loss of vision do so 
as a result of ocular neovascularization; age related macular degeneration 

10 (ARMD) affects 1 2-15 million American over the age of 65 and causes visual 
loss in 10-1 5% of them as a direct effect of choroidal (sub-retinal) 
neovascularization. The leading cause of visual loss for Americans under the 
age of 65 is diabetes; 1 6 million individuals in the United States are diabetic and 
40,000 per year suffer from ocular complications of the disease, which often are 

15 a result of retinal neovascularization. Laser photocoagulation has been effective 
in preventing severe visual loss in subgroups of high risk diabetic patients, but 
the overall 10 year incidence of retinopathy remains essentially unchanged. For 
patients with choroidal neovascularization due to ARMD or inflammatory eye 
disease, such as ocular histoplasmosis, photocoagulation, with few exceptions, 

20 is ineffective in preventing visual loss. While recently developed, non- 
destructive photodynamic therapies hold promise for temporarily reducing 
individual loss in patients with previously untreatable choroidal 
neovascularization, only 61.4% of patients treated every 3-4 months had 
improved or stabilized vision compared to 45.9% of the placebo-treated group. 

25 In the normal adult, angiogenesis is tightly regulated and limited to wound 

healing, pregnancy and uterine cycling. Angiogenesis is turned on by specific 
angiogenic molecules such as basic and acidic fibroblast growth factor (FGF), 
vascular endothelial growth factor (VEGF), angiogenin, transforming growth 
factor (TGF), tumor necrosis factor-cr (TNF-ff) and platelet derived growth factor 

30 (PDGF). Angiogenesis can be suppressed by inhibitory molecules such as 

interferon-a, thrombospondin-1 , angiostatin and endostatin. It is the balance of 
these naturally occurring stimulators and inhibitors that controls the normally 
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quiescent capillary vasculature. When this balance is upset, as in certain disease 
states, capillary endothelial ceils are induced to proliferate, migrate and 
ultimately differentiate. 

Angiogenesis plays a central role in a variety of diseases, including, but 
5 are not limited to, cancer and ocular neovascularization. Sustained growth and 
metastasis of a variety of tumors has also been shown to be dependent on the 
growth of new host blood vessels into the tumor in response to tumor derived 
angiogenic factors. Proliferation of new blood vessels in response to a variety of 
stimuli occurs as the dominant finding in the majority of eye diseases that blind, 

10 such as, but are not limited to, proliferative diabetic retinopathy (PDR), ARMD, 
rubeotic glaucoma, interstitial keratitis and retinopathy of prematurity. In these 
diseases, tissue damage can stimulate release of angiogenic factors resulting in 
capillary proliferation. VEGF plays a dominant role in iris neovascularization and 
neovascular retinopathies. While reports clearly show a correlation between 

15 intraocular VEGF levels and ischemic retinopathy ocular neovascularization, FGF 
likely plays a role. Basic and acidic FGF are known to be present in the normal 
adult retina, even though detectable levels are not consistently correlated with 
neovascularization. This may be largely due to the fact that FGF binds very 
tightly to charged components of the extracellular matrix and may not be readily 

20 available in a freely diffusible form that would be detected by standard assays of 
intraocular fluids. 

A final common pathway in the angiogenic response involves integrin- 
mediated information exchange between a proliferating vascular endothelial cell 
and the extracellular matrix. This class of adhesion receptors, called integrins, 

25 are expressed as heterodimers having an or and p subunit on all cells. One such 
integrin, aj z , is the most promiscuous member of this family and allows 
endothelial cells to interact with a wide variety of extracellular matrix 
components. Peptide and antibody antagonists of this integrin inhibit 
angiogenesis by selectively inducing apoptosis of the proliferating vascular 

30 endothelial cells. Two cytokine-dependent pathways of angiogenesis exist and 
may be defined by their dependency on distinct vascular cell integrins, and 
aj? s . Specifically, basic FGF- and VEGF-induced angiogenesis depend on integrin 
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aj9 3 and a^, respectively, since antibody antagonists of each integrin 
selectively block one of these angiogenic pathways in the rabbit corneal and 
chick chorioallantoic membrane (CAM) models. Peptide antagonists that block 
all a v integrins inhibit FGF- and VEGF-stimulated angiogenesis. While normal 
5 human ocular blood vessels do not display either integrin, aJ3 3 and a^ s integrins 
are selectively displayed on blood vessels in tissues from patients with active 
neovascular eye disease. While only aj z was consistently observed in tissue 
from patients with ARMD,crj? 3 and aj? 5 were present in tissues from patients 
with PDFL Systemically administered peptide antagonists of integrins blocked 

10 new blood vessel formation in a mouse model of retinal vasculogenesis. 

In addition to adhesion events described above, cell migration through the 
extracellular matrix also depends on proteolysis. Matrix metalloproteinases are a 
family of zinc-requiring matrix-degrading enzymes that include the collagenases, 
gelatinases and stromelysins, all of which have been implicated in invasive cell 

15 behavior. Invasive cell processes such as tumor metastasis and angiogenesis 
have been found to be associated with the expression of integrins and MMP-2, 
MMP-2 are all found throughout the eye where they may interact to maintain a 
quiescent vasculature until the balance is upset, resulting in pathological 
angiogenesis. A non-catalytic C-terminal hemopexin-like domain of MMP-2 

20 (PEX) can block cell surface collagenolytic activity and inhibit angiogenesis in the 
CAM model by preventing localization of MMP-2 to the surface of invasive cells 
through interaction with the integrin aj? 3 - 

Hence, anti-angiogenic agents have a role in treating retinal degeneration 
to prevent the damaging effects of these trophic and growth factors. 

25 Angiogenic agents, also have a role in promoting desirable vascularization to 
retard retinal degeneration by enhancing blood flow to cells. 

Members of adenovirus subgroup D, Ad8, 19A, and 37, are infectious 
agents that cause particularly severe cases of epidemic keratoconjunctivitis 
(EKC) (Amberg et aL (1998) Virology 227:239-244; Curtis et aL (1998) 

30 J.Med.MicrobioL Ritterband et aL (1998) Rev.Med.ViroL 5:187-201; 

and Takeuchi et aL (1999) J.CIin.MicrobioL 37:3392-3394). There is no 
effective treatment for this debilitating and contagious disease and EKC 
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continues to be a problem in ophthalmology clinics worldwide (Curtis et aL 
(1998) J.Med.MicrobioL 47:91-94, Lukashok et aL (1998) 

Curr.Clin.Top.lnfec.Dis. 75:286-304). Hence the vectors herein may be used for 



treating the disease. 

5 Table 3 

Candidate targets for ocular disease therapy 



15 



CANDIDATE TARGETS FOR OCULAR DISEASE THERAPY 


Disease 


Candidate target(s) 


Retinitis pigmentosa 


Rhodopsin gene, and genes that 
regulate expression thereof 
rcfe/peripherin 


Stargardt's disease 


rim protein (ARC protein) 






Choroideremia 


rab geranylgeranyl transferase 
CHM, TCD, CHML* 


Gyrate Atrophy 


ornithine aminotransferase 


Macular dystrophy 


rcfe/peripherin 


* see, "MSR6-yeast homologue of the choroideraemia gene/ Nature Genetics 
3: 193-4 (1993) 

TABLE 4 



Other Diseases 



Exudative Ch oroidal Diseases 

ICSC, fluorescein angiogram 

ICSC with lar ge serious detachment of RPE (retinal pigmented epithelium) 

ICSC with bullous retinal detachment : 

Macular dru sen, exudative, confluent , 

Drusen, su b-RPE choroidal neovascularization _ 

Drusen, notched serous detachment of RPE 
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Other Diseases 



Drusen, notched sero us and hemorrhagic detachment of RPE 
Drusen, serous and hemorrhagic detachment of RPE and retina 



Drusen, organized RPE detachment causing bullous retinal detachment 

Drusen, geographic atrophy of RPE 

Drusen, exudative and cu ticular, vitelliform macular detachment 

Drusen, cuticular, large vitelliform macular detachment 



North Carolina dystrophy with macular staphyloma 



North Carolina dystrophy with macular staphyloma 



Angioid stre aks, pseudoxanthoma elasticum (PXE), CNVM 
10 Angioid streaks, PXE, large notched retinal detachment 

Myopic degeneration, Foerster-Fuchs spot 



Presumed ocular histoplasmosis syndrome (POHS) 



Submacular bacterial abscess 



Toxocara cam's, subretinal granuloma 



15 Serpiginous (geographic) choroiditis 



Posterior scleritis 



Harada's disease 



Posterior sympathetic uveitis 



Benign reactive lymphoid hyperplasia of uveal tract 



20 Choroidal ruptures and CNVM 



Cavernous hemangioma of choroid 



Choroidal osteoma 



Choroidal nevus, serous macular detachment 



Choroidal nevus with CNVM 



25 Diffuse sclerochoroidal melanocytic nevus 



Choroidal melanoma with serous detachment of RPE 



Metastatic lung carcinoma to choroid 



Sub-RPE reticulum cell sarcoma 
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Other Diseases 

RPE tear, idiopathic c horoidal neovascularization 

Heredodys trophic Disorders Affecting RPE & Retina 

Best's vitelliform macular dystrophy 

Best's v itelliform macular dystrophy with CNVM 

Best's vitelliform macular dystrophy, multiple lesions 

Adult-onset vitelliform foveomacular dystrophy 

Pattern dystrophy s imulating fundus flavimaculatus 

Stargardt's dis ease (fundus flavimaculatus) ' 

Asteroid macular dystrophy 

Sjogren-Larssen syndrome 

Oguchi's disease, light-adapted state ; 

Oguchi's disease, dark-adapted state 

Fundus al bipunctatus 

Retinitis pigmentosa, cystoid macular edema 

Crystalline tapetoretinal dystrophy 

Choroideremia 

Goldmann-Favre syndrome 

Sex-linked juvenile retinoschisis 

Periveno us retinitis pigmentosa 

Retin al Vascular Disorders 

Retinal ar teriovenous aneurysm 

Central retinal artery occlusion 

Cilioretinal artery obstruction 

Ischemic reti nopathy in systemic lupus erythematosus 

Ischemic re tinopathy in scleroderma 

Hemorrhagic detachment of internal limiting membrane, hypertensive 
retinopathy 

Acquired retinal arterial macroaneurysm 
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Other Diseases 



Cystoid macular edema, aphakic 



Cystoid macular edema, nicotinic acid maculopathy 



Congenital retinal telangiectasis 



Acquired bilateral juxtafoveal telangiectasis 



Acquired bilateral juxtafoveal obliterative telangiectasis 



Diabetic optic neuropathy 



X-ray radiation exudative retinopathy 



Sickle cell SC disease, macular hemorrhage 



Retinal arterial aneurysms, arteritis, neuroretinitis 



10 Branch reti nal vein obstruction (BRVO) 
BRVO, exudative maculopathy 



BRVO, optic disc new vessels, photocoagulation 
Waldenstrom's macroglobulinemia 



Inflammatory Diseases of the Retina and Choroid 



1 5 Luetic retinal vasculitis 



Focal Candida retinal abscess 



Toxoplasmosis, atrophic chorioretinal scar 



Toxoplasmosis retinitis and macular detachment 



Toxoplasmosis scar, CNVM, macular detachment 



20 Diffuse unilateral subacute neuroretinitis, small worm 



Diffuse unilateral subacute neuroretinitis, large worm 



Cytomegalic inclusion disease, papillitis 



Acute pos terior multifocal placoid pigment epitheliopathy 

Acute macular neuroretinitis 

25 Sarcoid retinitis 



Sarcoid papillitis 



Behcet's disease 



Vitiliginous (bird-shot) chorioretinitis 
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Othe r Diseases 

Multifocal choro iditis and panveitis (pseudo-PQHS) 

Retinal an d Pigment Epithelial Hamartomas 

Congenital grouped albinotic RPE spots 

Co ngenital hyperplasia of RPE 

5 Combined RPE and retinal hamartoma, juxtapapillary 

Combined RPE and retinal hamartoma, peripheral 

Cystic astrocytoma, juxtapapillary 

Astrocytoma, macula 

Astrocytoma, juxtapapillary - 

10 Caverno us hemangioma of retina 

Juxtapapillary s essile retinal capillary hemangioma 

Juxtapapillary endophytic retinal capillary hemangioma 

Other Tum ors of the Choroid 

C horoidal metastasis 

15 Choroidal osteoma ' 

Ch oroidal hemangioma 

Miscellane ous uveal tumors 

Intraocul ar Lymphoid Tumors 

The leukemias and lymphomas 

20 Tu mors of the Vitreous 

Non-Hodgkins ("reticulum cell") lymphoma 

Tumor in volvement of the vitreous cavity 

Macular Disease 

Age-related ma cular degeneration - atrophic form 

25 Exudative age- related macular degeneration 

C horoidal neovascular membrane in degenerative myopia 
Central serous retinopathy 



Macular hole 
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Other Diseases 



Macular dystrophies 



Retinal Vascular Disease 



Etiologic mechanisms in diabetic retinopathy 



Background diabetic retinopathy 



Proliferative diabetic retinopathy 



Retinal arterial obstructive disease 



Central retinal vein occlusion 



Retinal branch vein occlusion 



10 



Pregnancy and retinal disease 



Pregnancy-induced hypertension 



Hypertension 



The rheumatic disease 



Parafoveal telangiectasis 



15 



Coats disease 



Disseminated intravascular systemic coagulopathy and related vasculopathies 



Hemoglobinopathies 



Retinopathy of prematurity 



Acquired retinal macroaneurysms 



20 



Eales disease 



Radiation retinopathy 



The ocular ischemic syndrome 



Inflammatory Disease 



Ocular toxoplasmosis 



Ocular toxocariasis 



25 



Ocular cysticercosis 



Cytomegalovirus infections of the retina 



Retinal and ophthalmologic manifestations of AIDS 



Acute retinal necrosis syndrome 



t 
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Other Diseases 



Endogenous fungal infections of the retina and choroid 



Pars planitis 



Syphilis and tuberculosis 



Diffuse unilateral subacute neuroretinitis 



Scleritis 



Birdshot retinochoroidopathy 



Punctate inner choroidopathy 



10 



Sarcoidosis 



Acute multifocal placoid pigment epitheliopathy 



Geographic helicoid peripapillary choroidopathy (GHPC): serpiginous 
choroiditis 



Sympathetic ophthalmia 



Vogt-Koyanigi-Harada syndrome (uveomeningitic syndrome) 



Ciliochoroidal (uveal) effusion 



15 



Reproduced from: Stereoscopic Atlas of Ocular Diseases Diagnosis and Treat- 
ment, 2nd Edition, J. Donald O. Gass, Vol. 1 & 2, C.V. Mosley Co. (1987); 
and Retina Vol. II, Editor, Stephen J. Ryan, Medical Retina, C.V. Moslay Co. 
(1989). 

20 2. Therapeutic products 

Therapeutic products include but are not limited to, wild-type genes that 
are defective in ocular disorders, such as rhodopsin, or fragments thereof 
sufficient to correct the genetic defect, trophic factors, including growth factors, 
inhibitors and agonists of trophic factors, anti-apoptosis factors and other 

25 products described herein or known to those of skill in the art to be useful for 
treatment of disorders of the eye or that can be treated by a product expressed 
by a photoreceptor. 



OCULAR GENE THERAPY STRATEGIES 
GENERAL DISEASE \ EXAMPLES I STRATEGY 
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OCULAR GENE THERAPY STRATEGIES 


Hereditary retinal and 
macular degeneration 


• Retinitis pigmentosa 

• Stargardt's disease 

• Other macular 
dystrophies 


Growth factors (e.g., 
GDNF) 

anti-apoptotic factors 
(e.g., bcl2 gene) 
Stargardt Disease Gene 
(ABCR) T 


Neovascular 


• Diabetes 

• Choroidal 
neovascularization 


Anti-angiogenesis 
factors 


Anti-tumor 


Retinoblastoma 


Antiproliferant 


Glaucoma 


Nerve fiber layer 
atrophy 


Neuroprotective agent 



See Allikmets et al. (1997) Science 277:1805-1 SO/. 



For example, for treatment of retinitis pigmentosa the adenovirus vector 
can deliver a wild-type rhodopsin gene or a growth factor or trophic factor, such 
as ciliary neurotrophic factor CNTF; for treatment of Stargardt's disease, the 
10 vector can deliver a wild type ABCR (also called STGD1) or a growth factor or 
anti-angiogenic agent; for diabetic retinopathies, retinal vascularization the 
vector can deliver growth factors, such as a TGF (TGFjff), to prevent 
degeneration. 

15 The following examples are included for illustrative purposes only and are 

not intended to limit the scope of the invention. 

EXAMPLE 1 

Preparation of Adenovirus Packaging Cell Lines 

Cell lines that are commonly used for growing adenovirus are useful as 

20 host cells for the preparation of adenovirus packaging cell lines. Preferred cells 
include 293 cells, an adenovirus-transformed human embryonic kidney cell line 
obtained from the ATCC, having Accession Number CRL 1 573; HeLa, a human 
epithelial carcinoma cell line (ATCC Accession Number CCL-2); A549, a human 
lung carcinoma cell line (ATCC Accession Number CCL 1 889); and other 

25 epithelial-derived cell lines. As a result of the adenovirus transformation, the 
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293 cells contain the E1 early region regulatory gene, AH ceils were maintained 

in complete DMEM + 10% fetal calf serum unless otherwise noted. 

These cell lines allow the production and propagation of adenovirus-based 

gene delivery vectors that have deletions in preselected gene regions and that 

5 are obtained by cellular complementation of adenoviral genes. To provide the 

desired complementation of such deleted adenoviral genomes in order to 

generate a viral vector, plasmid vectors that contain preselected functional units 

have been designed. Such units include but are not limited to E1 early region, 

E4 and the viral fiber gene. The preparation of plasmids providing such 

10 complementation, thereby being "complementary plasmids or constructs," that 

are stably inserted into host cell chromosomes are described below. 

A. Preparation of an E4-Expressing Plasmid for Complementation of • 
E4-Gene-De!eted Adenoviruses 

The viral E4 regulatory region contains a single transcription unit that is 

15 alternately spliced to produce several different mRNA products. The 

E4-expressing plasmid prepared as described herein and used to transfect the 
293 cell line contains the entire E4 transcription unit. A DNA fragment 
extending from 175 nucleotides upstream of the E4 transcription start site 
including the natural E4 promoter to 153 nucleotides downstream of the E4 

20 polyadenylation signal including the natural E4 terminator signal, corresponding 
to nucleotides 32667-35780 of the adenovirus type 5 (hereinafter referred to as 
Ad5) genome as described in Chroboczek et al. {ViroL, 186:280-285 (1992), 
GenBank Accession Number M73260), was amplified from Ad5 genomic DNA, 
obtained from the ATCC, via the polymerase chain reaction (PCR). Sequences of 

25 the primers used were 5XGGTACACAGAATTCAGGAGACACAACTCC3' 
(forward or 5' primer referred to as E4L) (SEQ ID NO: 1) and 
5'GCC TGGATCC GGGAAGTTACGTAACGTGGGAAAAC3' (SEQ ID NO: 2) 
(backward or 3' primer referred to as E4R). To facilitate cloning of the PCR 
fragment, these oligonucleotides were designed to create new sites for the 

30 restriction enzymes EcoRI and BamHI, respectively, as indicated with underlined 
nucleotides. DNA was amplified via PCR using 30 cycles of 92 C for 1 minute. 



0 
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50 C for 1 minute, and 72 C for 3 minutes resulting in amplified full-length E4 
gene products. 

The amplified DNA E4 products were then digested with EcoRI and 
BamHI for cloning into the compatible sites of pBluescript/SK + by standard 
5 techniques to create the plasmid pBS/E4. A 2603 base pair (bp) cassette 
including the herpes simplex virus thymidine kinase promoter, the hygromycin 
resistance gene, and the thymidine kinase polyadenylation signal was excised 
from the plasmid pMEP4 (Invitrogen, San Diego, CA) by digestion with Fspl 
followed by addition of BamHI linkers (5XGCGGATCCGCG3') (SEQ ID NO: 3) 
10 for subsequent digestion with BamHI to isolate the hygromycin-containing 
fragment. 

The isolated BamHI-modified fragment was then cloned into the BamHI 
site of pBS/E4 containing the E4 region to create the plasmid pE4/Hygro 
containing 8710 bp. The pE4/Hygro plasmid has been deposited with the ATCC 
15 under accession number 97739. The complete nucleotide sequence of 

pE4/Hygro is set forth in SEQ ID NO: 4. Position number 1 of the linearized 
vector corresponds to approximately the middle portion of the pBS/SK + 
backbone. The 5' and 3' ends of the E4 gene are located at respective 
nucleotide positions 3820 and 707 of SEQ ID NO: 4 while the 5' and 3' ends of 
20 the hygromycin insert are located at respective nucleotide positions 3830 and 
6470. In the clone that was selected for use, the E4 and hygromycin resistance 
genes were divergently transcribed. 

B. Preparation of a Fiber-Expressing Plasmid for Complementation of 
Fiber-Gene-Deleted Adenoviruses 

25 To prepare a fiber-encoding construct, primers were designed to amplify 

the fiber coding region from Ad5 genomic DNA with the addition of unique 
BamHI and Notl sites at the 5' and 3' ends of the fragment, respectively. The 
Ad5 nucleotide sequence is available with the GenBank Accession Number 
Ml 8369. The 5' and 3' primers had the respective nucleotide sequences of 

30 5 r AtGGGAICCAAGATGAAGCGCGCAAGACCG3' (SEQ ID NO: 5) and 

5 ' C AT A A C GCGGCCGCTTCTTT ATTCTTG G G C 3 ' (SEQ ID NO: 6), where the 
inserted BamHI and Notl sites are indicated by underlining. The 5' primer also 
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10 



contained a nucleotide substitution 3 nucleotides 5' of the second ATG codon (C 
to A) that is the initiation site. The nucleotide substitution was included so as to 
improve the consensus for initiation of fiber protein translation. 

The amplified DNA fragment was inserted into the BamHI and Notl sites 
of pcDNA3 (Invitrogen) to create the plasmid designated P CDNA3/Fiber having 
7148 bp. The parent plasmid contained the CMV promoter, the bovine growth 
hormone (BHG) terminator and the gene for conferring neomycin resistance. The 
viral sequence included in this construct corresponds to nucleotides 
31040-32791 of the Ad5 genome. 

The complete nucleotide sequence of P CDNA3/Fiber is listed in SEQ ID 
NO: 7 where the nucleotide position 1 corresponds to approximately the middle 
of the pcDNA3 vector sequence. The 5' and 3' ends of the fiber gene are 
located at respective nucleotide positions 916 with ATG and 2661 with TAA. 

To enhance expression of fiber protein by the constitutive CMV promoter 
15 provided by the pcDNA vector, a Bglll fragment containing the tripartite leader 
(TPL) of adenovirus type 5 was excised from pRD112a (Sheay eta/., 
BioTechniques, 15:856-862 (1993) and inserted into the BamHI site of 
pCDNA3/Fiber to create the plasmid pCLF having 7469 bp. The adenovirus 
tripartite leader sequence, present at the 5' end of all major late adenoviral 
20 mRNAs as described by Logan etaL. Proc.Natl. Acad. ScL, USA. 81 :3655- 

3659 (1984) and Berkner. BioTechniques, 6:616-629 (1988), also referred to as 
a "partial TPL" since it contains a partial exon 1, shows correspondence with the 
Ad5 leader sequence having three spatially separated exons corresponding to 
nucleotide positions 6081-6089 (the 3' end of the first leader segment), 71 1 1- 
25 7182 (the entire second leader segment), and 9644-9845 (the third leader 

segment and sequence downstream of that segment). The corresponding cDNA 
sequence of the partial tripartite leader sequence present in pCLF is included in 
SEQ ID NO: 8 bordered by BamHI/Bglll 5' and 3' sites at respective nucleotide 
positions 907-912 to 1228-1233. The nucleotide sequence of an isolated partial 
30 TPL is also listed separately as SEQ ID No. 22 with the noted 5' and 3' 

restriction sites and with the following nucleotide regions identified: 1-6 nt Bglll 
site; 1-18 nt polylinker; 19-27 nt last 9 nt of the first leader segment (exon 1); 
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28-99 nt second leader segment (exon 2); 100-187 nt third leader segment 
(exon 3); 1 88-301 nt contains the nt sequence immediately following the third 
leader in the genome with an unknown function; and 322-327 nt Bglll site. 
The pCLF plasmid has been deposited with the ATCC as described in 
5 Example 4. The complete nucleotide sequence of pCLF is listed in SEQ ID NO: 8 
where the nucleotide position 1 corresponds to approximately the middle of the 
pcDNA3 parent vector sequence. The 5' and 3' ends of the Ad5 fiber gene are 
located at respective nucleotide positions 1237-1239 with ATG and 2980-2982 
with TAA. 

10 C . Generation of an Adenovirus Packaging Cell Line Carrying Plasmlds 

Encoding Functional E4 and Fiber Proteins 
The 293 cell line was selected for preparing the first adenovirus 
packaging line as it already contains the E1 gene as prepared by Graham eta/., 
J. Gen. Virol., 35:59-74 (1977) and as further characterized by Spector, Virol., 

15 730:533-538 (1983). Before electroporation, 293 cells were grown in RPMI 
medium + 10% fetal calf serum. Four x 106 cells were electroporated with 20 
pg each of P E4/Hygro DNA and pCLF DNA using a BioRad GenePulser and 
settings of 300 V, 25 jjP. DNA for electroporation was prepared using the 
Qiagen system according to the manufacturer's instructions (Bio-Rad, Richmond, 

20 CA). 

Following electroporation, cells were split into fresh complete DMEM + 
10% fetal calf serum containing 200//g/ml Hygromycin B (Sigma, St. Louis, 
MO). 

From expanded colonies, genomic DNA was isolated using the 
25 "MICROTURBOGEN" system (Invitrogen) according to manufacturer's 

instructions. The presence of integrated E4 DNA was assessed by PCR using 
the primer pair E4R and ORF6L (5'TGCTTAAGCGGCCGCGAAGGAGA AGTCC3') 
(SEQ ID NO: 9), the latter of which is a 5' forward primer near adenovirus 5 
open reading frame 6. 
30 One clone, designated 21 1 , was selected exhibiting altered growth 

properties relative to that seen in parent cell line 293. The 211 clone contained 
the product,indicating the presence of inserted DNA corresponding to most, if 
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not all, of the E4 fragment contained in the pE4/Hygro plasmid. The 21 1 cell 
line has been deposited with the ATCC as described in Example 4. This line was 
further evaluated by amplification using the primer pair E4L/E4R described 
above, and a product corresponding to the full-length E4 insert was detected. 
5 Genomic Southern blotting was performed on DNA restricted with EcoRI and 
BamHl. The E4 fragment was then detected at approximately one copy/genome 
compared to standards with the EcoRI/BamHI E4 fragment as cloned into pBS/E4 
for use as a labeled probe with the Genius system according to manufacturer's 
instructions (Boehringer Mannheim, Indianapolis, IN). In DNA from the 211 cell 

10 line, the labeled internal fragment pE4/Hygro hybridized with the isolated E4 

sequences. In addition, the probe hybridized to a larger fragment which may be 
the result of a second insertion event. 

Although the 21 1 cell line was not selected by neomycin resistance, thus 
indicating the absence of fiber gene, to confirm the lack of fiber gene, the 21 1 

1 5 cell line was analyzed for expression of fiber protein by indirect 

immunofluorescence with an anti-fiber polyclonal antibody and a FITC-labeled 
anti-rabbit IgG (KPL) as secondary. No immunoreactivity was detected. 
Therefore, to generate 21 1 clones containing recombinant fiber genes, the 211 
clone was expanded by growing in RPMI medium and subjected to additional 

20 electroporation with the fiber-encoding pCLF plasmid as described above. 

Following electroporation, cells were plated in DMEM + 10% fetal calf 
serum and colonies were selected with 200 //g/ml G41 8 (Gibco, Gaithersburg, 
MD). Positive cell lines remained hygromycin resistant. These candidate 
sublines of 2 1 1 were then screened for fiber protein expression by indirect 

25 immunofluorescence as described above. The three sublines screened, 211 A, 
21 1B and 21 1R, along with a number of other sublines, all exhibited nuclear 
staining qualitatively comparable to the positive control of 293 cells infected 
with AdRSV^gal (1 pfu/cell) and stained 24 hours post-infection. 

Lines positive for nuclear staining in this assay were then subjected to 

30 Western blot analysis under denaturing conditions using the same antibody. 

Several lines in which the antibody detected a protein of the predicted molecular 
weight {62 kd for the Ad5 fiber protein) were selected for further study including 
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211 A, 21 1B and 21 1R. The 211 A cell line has been deposited with ATCC as 
described in Example 4. 

Immunoprecipitation analysis using soluble nuclear extracts from these 
three cell lines and a seminatiye electrophoresis system demonstrated that the 
5 fiber protein expressed is in the functional trimeric form characteristic of the 
native fiber protein. The predicted molecular weight of a trimerized fiber is 186 
kd. Under denaturing conditions, the trimeric form was destroyed resulting in 
detectable fiber monomers. Those clones containing endogenous E1, newly 
expressed recombinant E4 and fiber proteins were selected for use in 
10 complementing adenovirus gene delivery vectors having the corresponding 
adenoviral genes deleted as described in Example 2. 

D. Preparation of an E1 -Expressing Plasmid for Complementation of 

E1 -Gene-Deleted Adenoviruses 
In order to prepare adenoviral packaging cell lines other than those based 
15 on the E1-gene containing 293 cell line as described in Example 1C above, 

plasmid vectors containing E1 alone or in various combinations with E4 and fiber 
genes are constructed as described below. 

The region of the adenovirus genome containing the E1a and E1b gene is 
amplified from viral genomic DNA by PCR as previously described. The primers 
20 used are E1L, the 5' or forward primer, and E1R, the 3' or backward primer, 
having the respective nucleotide sequences 5'CCG AGCTAGC 
GACTGAAAATGAG3' (SEQ ID NO: 10) and 5'CCTCTCGAG AG AC AG C 
AAGACAC3' (SEQ ID NO: 11). The E1L and E1R primers include the respective 
restriction sites Nhel and Xhol as indicated by the underlines. The sites are used 
25 to clone the amplified E1 gene fragment into the Nhel/Xhol sites in pMAM 
commercially available from Clontech (Palo Alto, CA) to form the plasmid 
pDEX/E1 having 11152 bp. 

The complete nucleotide sequence of pDEX/E1 is listed in SEQ ID NO: 1 2 
where the nucleotide position 1 corresponds to approximately 1 454 nucleotides 
30 from the 3' end of the pMAM backbone vector sequence. The pDEX/E1 plasmid 
includes nucleotides 552 to 4090 of the adenovirus genome positioned 
downstream (beginning at nucleotide position 1460 and ending at 4998 in the 
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pDEX/E1 plasmid) of the glucocorticoid-inducible mouse mammary tumor virus 
(MMTV) promoter of pMAM. The pMAM vector contains the E. coli gpt gene 
that allows stable transfectants to be isolated using hypoxanthine/amino- 
pterin/thymidine (HAT) selection. The pMAM backbone occupies nucleotide 
5 positions 1-1454 and 5005-11152 of SEQ ID NO: 12. 

E. Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 
Encoding Functional E1, and Fiber Proteins 

To create separate adenovirus packaging cell lines equivalent to that of 
the 211 sublines, 211 A, 21 1B and 21 1R, as described in Example 1C, 

10 alternative cell lines lacking adenoviral genomes are selected for transfection 
with the plasmid constructs as described below. Acceptable host cells include 
A549, Hela, Vero and the like cell lines as described in Example 1 . The selected 
cell line is transfected with the separate plasmids, pDEX/E1and pCLF, 
respectively for expressing El, and fiber complementary proteins. Following 

15 transfection procedures as previously described, clones containing stable 

insertions of the two plasmids are isolated by selection with neomycin and HAT. 
Integration of full-length copy of the E1 gene is assessed by PCR amplification 
from genomic DNA using the primer set E1L/E1R , as described above. 
Functional insertion of the fiber gene is assayed by staining with the anti-fiber 

20 antibody as previously described. 

The resultant stably integrated cell line is then used as a packaging cell 

system to complement adenoviral gene delivery vectors having the 

corresponding adenoviral gene deletions as described in Example 2. 

F. Preparation of a Plasmid Containing Two or More Adenoviral Genes for 
25 Complementing Gene-Deleted Adenoviruses 

The methods described in the preceding Examples rely on the use of two 

plasmids, pE4/Hygro and pCLF, or, pCLF and pDEX/E1 for generating adenoviral 

cell packaging systems. In alternative embodiments, complementing plasmids 

containing two or more adenoviral genes for expressing of encoded proteins in 

30 various combinations are also prepared as described below. The resultant 

plasmids are then used in various cell systems with delivery plasmids having the 

corresponding adenoviral gene deletions. The selection of packaging cell, 

content of the delivery plasmids and content of the complementing plasmids for 
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use in generating recombinant adenovirus viral vectors thus depends on whether 
other adenoviral genes are deleted along with the adenoviral fiber gene, and, if 
so, which ones. 

1 . Preparation of a Complementing Plasmid Containing Fiber 
5 and E1 Adenoviral Genes 

A DNA fragment containing sequences for the CMV promoter, adenovirus 
tripartite leader, fiber gene and bovine growth hormone terminator is amplified 
from pCLF prepared in Example 1 B using the forward primer 
5'GACGGATCGGGAGATCTCC3' {SEQ ID NO: 13), that anneals to the 
10 nucleotides 1-19 of the pCDNA3 vector backbone in pCLF, and the backward 
primer 5'CCGCCTCAGAAGCCATAGAGCC3' (SEQ ID NO: 14) that anneals to 
nucleotides 1278-1257 of the pCDNA3 vector backbone. The fragment is 
amplified as previously described and then cloned into the pDEX/E1 plasmid, 
prepared in Example 1D. For cloning in the DNA fragment, the pDEX/E1 vector 
15 is first digested with Ndel, that cuts at a unique site in the pMAM vector 
backbone in pDEX/E1, then the ends are repaired by treatment with 
bacteriophage T4 polymerase and dNTPs. 

The resulting plasmid containing El and fiber genes, designated 
pE1 /Fiber, provides dexamethasone-inducible E1 function as described for 
20 DEX/E1 and expression of Ad5 fiber protein as described above. 

The complete nucleotide sequence of pE1 /Fiber is listed in SEQ ID NO: 15 
where the nucleotide position 1 corresponds to approximately 1459 nucleotides 
from the 3' end of the parent vector pMAM sequence. The 5' and 3' ends of 
the Ad5 E1 gene are located at respective nucleotide positions 1460 and 4998 
25 followed by pMAM backbone and then separated from the Ad5 fiber from pCLF 
by the filled-in blunt ended Ndel site. The 5' and 3' ends of the pCLF fiber gene 
fragment are located at respective nucleotide positions 10922-14223 containing 
elements as previously described for pCLF. 

The resultant pE1 /Fiber plasmid is then used to complement one or more 
30 delivery plasmids expressing E1 and fiber. 

The pE1 /Fiber construct is then used to transfect a selected host cell as 
described in Example 1 E to generate stable chromosomal insertions preformed as 
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previously described followed by selection on HAT medium. The stable cells are 

then used as packaging cells as described in Example 2. 

2. Preparation of a Complementing Plasmid Containing E4 and 
Fiber Adenoviral Genes 

5 Plasmid pCLF prepared as described in Example 1 B is partially digested 

with Bglll to cut only at the site in the pCDNA3 backbone. The pE4/Hygro 

plasmid prepared in Example 1A is digested with BamHI to produce a fragment 

containing E4. The E4 fragment is then inserted into the BamHI site of pCLF to 

form plasmid pE4/Fiber. The resultant plasmid provides expression of the fiber 

10 gene as described for pCLF and E4 function as described for pE4/Hygro. 

A schematic plasmid map of pE4/Fiber, having 10610 bp. The complete 
nucleotide sequence of pE4/Fiber is listed in SEQ ID NO: 1 6 where the 
nucleotide position 1 corresponds to approximately 1 4 bp from the 3' end of the 
parent vector pCDIMA3 backbone sequence. The 5' and 3' ends of the Ad5 E4 

15 gene are located at respective nucleotide positions 21 and 3149 followed by 
fused BglH/BamHI sites and pCDNA3 backbone including the CMV promoter 
again followed by Bglll/BamHI sites. The adenovirus leader sequence begins at 
nucleotide position 4051 and extends to 4366 followed by fused BamHI/Bglll 
sites and the 5' and 3' ends of the fiber gene located at respective nucleotide 

20 positions 4372 and 6124. 

Stable chromosonal insertions of pE4/Fiber in host cells are obtained as 
described above. 

EXAMPLE 2 

Preparation of Adenoviral Gene Delivery Vectors Using Adenoviral Packaging Cell 
25 Lines 

Adenoviral delivery vectors are prepared to separately lack the 
combinations of E1 /fiber and E4/fiber. Such vectors are more replication- 
defective than those previously in use due to the absence of multiple viral genes, 
A preferred adenoviral delivery vector is replication competent but only via a 
30 non-fiber means is one that only lacks the fiber gene but contains the remaining 
functional adenoviral regulatory and structural genes. Furthermore, these 
adenovirus delivery vectors have a higher capacity for insertion of foreign DNA. 
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A. Preparation of Adenoviral Gene Delivery Vectors Having Specific Gene 
Deletions and Methods of Use 

To construct the E1/ fiber deleted viral vector containing the LacZ 
reporter gene construct, two new plasmids were constructed. The plasmid pA 
5 E1 B0ga\ was constructed as follows. A DNA fragment containing the SV40 

regulatory sequences and £. coli /?-galactosidase gene was isolated from pSV/?gai 
(Promega) by digesting with Vspl, filling the overhanging ends by treatment with 
Klenow fragment of DNA polymerase I in the presence of dNTP's and digesting 
with Bam H1 . The resulting fragment was cloned into the EcoRV and BamHI 

10 sites in the polylinker of pA E1sp1B (Microbix Biosystems, Hamilton, Ontario) to 
form pA E1 B jffgal that therefore contained the left end of the adenovirus genome 
with the Ela region replaced by the LacZ cassette (nucleotides 6690 to 4151) of 
pSVyff gal. Plasmid DNA may be prepared by the alkaline lysis method as 
described by Birnboim and Doly, Nuc. Acids Res., 7:1513-1523 (1978) or by the 

15 Quiagen method according to the manufacturer's instruction, from transformed 
cells used to expand the plasmid DNA was then purified by CsCI-ethidium 
bromide density gradient centrifugation. Alternatively, plasmid DNAs may be 
purified from E. coli by standard methods known in the art (e.g. see Sambrook et 
a/.) 

20 The second plasmid (pDV44), prepared as described herein, is derived 

from pBHGIO, a vector prepared as described by Bett etal., Proc. Natl. Acad. 
ScL, USA, 91:8802-8806 (1994) (see, also International PCT application No. 
W0 95/00655) using methods well known to one of skill in the art. This vector 
is also commercially available from Microbix and and contains an Ad5 genome 

25 with the packaging signals at the left end deleted and the E3 region (nucleotides 
28133:30818) replaced by a linker with a unique site for the restriction enzyme 
Pad. An 1 1 .9 kb BamHI fragment, which contains the right end of the 
adenovirus genome, is isolated from pBHGIO and cloned into the BamHI site of 
pBS/SK{ + ) to create plasmid pi 1.3 having approximately 14,658 bp. The p11.3 

30 plasmid was then digested with Pad and Sail to remove the fiber, E4, and 
inverted terminal repeat (ITR) sequences. 
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This fragment was replaced with a 3,4 kb fragment containing the ITR 
segments and the E4 gene which was generated by PCR amplification from 
pBHGIO using the following oligonucleotide sequences: 
5' TGTACACCG GATCCGGCGCACACC3' SEQ ID NO: 17; and 
5 5'CACAACGAGCTC A ATT A ATT A ATTG C C A C ATCCTC 3 ' SEQ ID NO: 18. 

These primers incorporated sites for Pad and BamHI. Cloning this fragment into 
the Pad and blunt ended Sail sites of the p1 1.3 backbone resulted in a 
substitution of the fused ITRs, E4 region and fiber gene present in pBHGIO, by 
the ITRs and E4 region aione. The resulting p1 1 .3 plasmid containing the ITR 

10 and E4 regions, designated plasmid pDV43a, was then digested with BamHI. 
This BamHI fragment was then used to replace a BamHI fragment in pBHGIO 
thereby creating pDV44 in a pBHGIO backbone. 

In an alternative approach to preparing pDV44 with an additional 
subcloning step to facilitate the incorporation of restriction cloning sites, the 

15 following cloning procedure was performed. pDV44 as above was constructed 
by removing the fiber gene and some of the residual E3 sequences from pBHGIO 
(Microbix Biosystems). As above, to simplify manipulations, the 1 1.9 kb BamHI 
fragment including the rightmost part of the Ad5 genome was removed from 
pBHGIO and inserted into pBS/SK. The resulting plasmid was termed p1 1 .3. 

20 The 3.4 kb DNA fragment corresponding to the E4 region and both ITRs of 
adenovirus type 5 was amplified as described above from pBHGIO using the 
oligonucleotides listed above and subcloned into the vector pCR2.1 (Invitrogen) 
to create pDV42. This step is the additional cloning step to facilitate the 
incorporation of a Sail restriction site. pDV42 was then digested with Pad, 

25 which cuts at a unique site (bold type) in one of the PCR primers, and with Sail, 
which cuts at a unique site in the pCR2.1 polylinker. This fragment was used to 
. replace the corresponding Pacl/Xhol fragment of p11.3 {the pBS polylinker 
adjacent to the Ad DNA fragment contains a unique Xhol site), creating pDV43. 

30 A plasmid designated pDV44 was constructed by replacing the 1 1 .9 kb 

BamHI fragment of pBHGIO by the analogous BamHI fragment of pDV43. As 
generated in the first procedure, pDV44 therefore differs from pBHGIO by the 
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deletion of Ad5 nucleotides 30819:32743 (residual E3 sequences and all but the 
3'-most 41 nucleotides of the fiber open reading frame). 

Thus, to summarize, the cloning procedures described above result in the 
production of a fiber-deleted Ad5 genomic plasmid (pDV44) that was 
constructed by removing the fiber gene and some of the residual E3 sequences 
from pBHGIO. pDV44 contains a wild-type E4 region, but only the last 41 
nucleotides of the fiber ORF {this sequence was retained to avoid affecting 
expression of the adjacent E4 transcription unit). Plasmids pBHGIO and pDV44 
contain unpackageable Ad5 genomes, and must be rescued by cotransfection 
and subsequent homologous recombination with DNA carrying functional 
packaging signals. In order to generate vectors marked with a reporter gene, 
either pDV44 or pBHGIO was cptransfected with pAEIBGgal, which contains 
the left end of the Ad5 genome with an SV40-driven S-galactosidase reporter 
gene inserted in place of the E1 region. 

In general, and as described below, the method for virus production by 
recombination of plasmids followed by complementation in cell culture involves 
the isolation of recombinant viruses by cotransfection of any one of the 
adenovirus packaging cell systems prepared in Example 1, namely 211 A, 21 1B, 
21 1R, A549, Vero cells, and the like, with plasmids carrying sequences 
corresponding to viral gene delivery vectors. 

A selected cell line is plated in dishes and cotransfected with pDV44 and 
pAE1B/? gal using the calcium phosphate method as described by Bett etaL, 
Proc. Natl. Acad. ScL, USA, 57:8802-8806 (1994). Recombination between the 
overlapping adenovirus sequences in the two plasmids leads to the creation of a 
full-length viral chromosome where pDV44 and pAE1B/? gal recombine to form a 
recombinant adenovirus vector having multiple deletions. The deletion of E1 and 
of the fiber gene from the viraF chromosome is compensated for by the 
sequences integrated into the packaging cell genome, and infectious virus 
particles are produced. The plaques thus generated are isolated and stocks of 
the recombinant virus are produced by standard methods. 

Because of the fiber deletion, a pDV44-derived virus is 
replication-defective, cells in which it is grown must complement this defect. 
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The 21 1B cell line (a derivative of 293 cells which expresses the wild-type (wt) 
AD5 fiber and is equivalent to 21 1 A on deposit with ATCC as described in 
Example 4} was used for rescue and propagation of the virus described here. 
pDV44 and pAEIfcgal were cotransfected into 21 1B cells, and the monolayers 
were observed for evidence of cytopathic effect (CPE). Briefly, for virus 
construction, cells were transfected with the indicated plasmids using the Gibcb 
Calcium Phosphate Transfection system according to the manufacturer's 
instructions and observed daily for evidence of CPE. 

One of a total of 58 transfected dishes showed evidence of spreading cell 
death at day 15. A crude freeze-thaw lysate was prepared from these cells and 
the resulting virus (termed Ad5.&gal.AF) was plaque purified twice and then 
expanded. To prepare purified viral preparations, cells were infected with the 
indicated Ad and observed for completion of CPE. Briefly, at day zero, 21 1B 
cells were plated in DMEM plus 10% fetal calf serum at approximately 1 X 10 7 
ceils/150 cm 2 flask or equivalent density. At day one, the medium was replaced 
with one half the original volume of fresh DMEM containing the indicated Ad, in 
this case Ad5.Sgal.AF, at approximately 100 particles/cell. At day two, an equal 
volume of medium was added to each flask and the cells were observed for CPE. 
Two to five days after infection, cells were collected and virus isolated by lysis 
via four rapid freeze-thaw cycles. Virus was then purified by centrifugation on 
preformed 15-40% CsCI gradients (1 1 1,000 x g for three hours at 4°C). The 
bands were harvested, dialyzed into storage buffer (10 mMTris-pH 8.1, 0.9% 
NaCI, and 10% glycerol), aliquoted and stored at - 70°C. Purified Ad5.Bgal.AF 
virus particles containing human adenovirus Ad5.£gal.AFgenome (described 
further below) have been deposited with the ATCC on January 15, 1999 as 
further described in Example 4. 

For viral titering, as necessary in the below Examples, Ad preparations 
were titered by plaque assay on 21 1B cells. Cells were plated on 
polylysine-coated 6 well plates at 1 .5 x 10 6 cells/well. Duplicate dilutions of 
virus stock were added to the plates in 1 ml/well of complete DMEM. After a 
five hour incubation at 37 °C, virus was removed and the wells overlaid with 2 
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ml of 0.6% low-melting agarose in Medium 199 (Gibco). An additional 1 ml of 
overlay was added at five day intervals. 

As a control, the first-generation virus Ad5.£ gal.wt, which is identical to 
Ad5.Bgal.AF except for the fiber deletion, was constructed by cotransfection of 
5 pBHGIO.and pAE1 BSgal. In contrast to the low efficiency of recovery of the 
fiberless genome (1/58 dishes), all of 9 dishes cotransfected with pAE1B/?gal 
and pBHGIO produced virus. 

In another embodiment, a delivery plasmid is prepared that does not 
require the above-described recombination events to prepare a viral vector 
10 having a fiber gene deletion. In one embodiment, a single delivery plasmid 
containing all the adenoviral genome necessary for packaging but lacking the 
fiber gene is prepared from plasmid pFG140 containing full-length Ad5 that is 
commercially available from Microbix. The resultant delivery plasmid referred to 
as pFG140-f is then used with pCLF stably integrated cells as described above to 
1 5 prepare a viral vector lacking fiber. For genetic therapy, the fiber gene can be 
replaced with a therapeutic gene of interest for preparing a therapeutic delivery 
adenoviral vector. Methods for producing a fiberless vector with a complete TPL 
are described in Example 3. 

Vectors for the delivery of any desired gene and preferably a therapeutic 
20 gene are prepared by cloning the gene of interest into the multiple cloning sites 
in the polylinker of commercially available pAE1sp1B (Microbix Biosystems), in 
an analogous manner as performed for preparing pE1 Bfi gal as described above. 
The same cotransfection and recombination procedure is then followed as 
described herein to obtain viral gene delivery vectors as further discussed in later 
25 Examples. 

1 . Characterization of the Ad5./?gal.AF Genome 
To confirm that the vector genomes had the proper structures and that 
the fiber gene was absent from the Ad5.fJgal.AF chromosome, the DNA isolated 
from viral particles was analyzed. Briefly, purified viral DNA was obtained by 
30 adding 10 //I of 10 mg/mi proteinase K, 40 p\ of 0.5 M EDTA and 50 §j\ of 10% 
SDS to 800 jA of adenovirus-containing culture supernatant. The suspension 
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was then incubated at 55 °C for 60 minutes. The solution was then extracted 
once with 

400 jj\ of a 24:1 mixture of chloroform:isoamyl alchohol. The aqueous phase 
was then removed and precipitated with sodium acetate/ethanol. The pellet was 
5 washed once with 70% ethanol and lightly dried. The pellet was then 

suspended in 40 p\ of 10 mM Tris-HCI, pH 8.0, 1 mM EDTA. Genomic DNA 
from Ad5.Bgal.wt and Ad5.fcgal.AF produced the expected restriction patterns 
following digestion with either EcoRI or with Ndel. Southern blotting, performed 
with standard methods, with labeled fiber DNA as a probe demonstrated the 

10 presence of fiber sequence in Ad5.fcgal.wt but not in Ad5.fcgal.AF DNA. As a 
positive control, the blot was stripped and reprobed with labeled E4 sequence. 
Fiber and E4 sequences were detected by using labeled inserts from pCLF and 
pE4/Hygro, respectively. E4 signal was readily detectable in both genomes at 
equal intensities. The complete nucleotide sequence of Ad5.figal.AF is presented 

15 in SEQ ID NO: 23 and is contained in the virus particle on deppsit with ATCC. 

2. Characterization of the Fiberless Adenovirus Ad5./?gal.AF 
To verify that Ad5.fcgal.AF was fiber-defective, 293 cells {which are 
permissive for growth of E1 -deleted Ad vectors but do not express fiber) were 
infected with Ad5.£gal.AF or with Ad5.fcgal.wt. Twenty-four hours post 

20 infection, the ceils were stained with polyclonal antibodies directed either against 

fiber or against the penton base protein. Cells infected with either virus were 

stained by the anti-penton base antibody, while only cells infected with the 

Ad5.Sgal.wt control virus reacted with the anti-fiber antibody. This confirms 

that the fiber-deleted Ad mutant does not direct the synthesis of fiber protein. 

25 3. Growth of the Fiber-Deleted Ad5./?gal.AF Vector in 

Complementing Cells 

Ad5.Bgal.AF was found to readily be propagated in 21 1B cells. As 

assayed by protein concentration, CsCI-purified stocks of either Ad5.Bgal.AF or 

Ad5.Qgal.wt contained similar numbers of viral particles. The particles appeared 

30 to band normally on CsCI gradients. Infectivity of the Ad5.Sgal.AF particles was 

lower than the Ad5.fcgal.wt control, as indicated by an increased particle/PFU 

ratio. Ad5.Rgal.AF was also found to plaque more slowly than the control 
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virus. When plated on 21 1B cells, Ad5.Sgal.wt plaques appeared within 5-7 
days, while plaques of Ad5.Sgal.AF continued to appear until as much as 15-18 
days post infection. Despite their slower formation, the morphology of 
Ad5.Rgal.AF plaques was essentially normal. 
5 4. Production of Fiberless Ad5.Rgal.AF Particles 

As Ad5.Bgal.AF represents a true fiber null mutation* and its stocks are 
free of helper virus, the fiber mutant phenotype was readily investigated. A 
single round of growth in cells (such as 293) which do not produce fiber 
generating a homogeneous preparation of fiberless Ad allowed for the 

10 determination of whether such particles would be stable and/or infectious. 
Either Ad5.Sgal.wt or Ad5.Sgal.AF was grown in 293 or 21 1B cells, and the 
resulting particles purified on CsCI gradients as previously described. 
Ad5.Sgal.AF particles were readily produced in 293 cells at approximately the 
same level as the control virus and behaved similarly on the gradients, indicating 

15 that there was not a gross defect in morphogenesis of fiberless capsids. 

Particles of either virus contained similar amounts of penton base 
regardless of the cell type in which they were grown. This demonstrated that 
fiber is not required for assembly of the penton base complex into virions. The 
Ad5.Sgal.AF particles produced in 293 cells did not contain fiber protein. 

20 21 IB-grown Ad5.Sgal.AF also contained less fiber than the Ad5.SgaI.wt control 

virus. The infectivities of the different viral preparations on epithelial cells 

correlated with the amount of fiber protein present. The fiberless Ad particles 

were several thousand-fold less infectious than the first-generation vector control 

on a per-particle basis, while infectivity of 21 IB-grown Ad5.Sgal.AF was only 

25 50-100 fold less than that of Ad5.Sgal.wt. These studies confirmed fiber's 

crucial role in infection of epithelial cells via CAR binding. 

5. Composition and Structure of the Fiberless Ad5.Sgal.AF 
Particles 

The proteins contained in particles of 293-grown Ad5.Sgal.AF were 
30 compared to those in Ad5.Sgal.wt, to determine whether proteolysis or particle 
assembly was defective in this fiber null mutant. The overall pattern of proteins 
in the fiberless particles was observed to be quite similar to that of a 
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first-generation vector, with the exception of reduced intensity of the composite 
band resulting from proteins Ilia and IV (fiber). The fiberless particles also had a 
reduced level of protein VII. Although substantial amounts of uncleaved 
precursors to proteins VI, VII, and VIII were not seen, it is possible that the 
5 low-molecular weight bands migrating ahead of protein VII represent either 
aberrantly cleaved viral proteins or their breakdown products. 

Cryo-electron microscopy was used to more closely examine the structure 
of the 293 grown Ad5.Rgal.AF and of AdSBgal.wt. The fiber, having an 
extended stalk with a knob at the end, was faintly visible in favorable 

10 orientations of wild-type Ad5 particles, but not in images of the fiberless 

particles. Filamentous material likely corresponding to free viral DNA was seen 
in micrographs of fiberless particles. This material was also present in 
micrographs of the first-generation control virus, albeit at much lower levels. 
Three-dimensional image reconstructions of fiberless and wild-type 

15 particles at —20 A resolution showed similar sizes and overall features, with the 
exception that fiberless particles lacked density corresponding to the fiber 
protein. The densities corresponding to other capsid proteins, including penton 
base and proteins Ilia, VI, and IX, were comparable in the two structures. This 
confirms that absence of fiber does not prevent assembly of these components 

20 into virions. The fiber was truncated in the wild-type structure as only the lower 
portion of its flexible shaft follows icosahedral symmetry. The RGD protrusions 
on the fiberless penton base were angled slightly inward relative to those of the 
wild-type structure. Another difference between the two penton base proteins 
was that there is a —30 A diameter depression in the fiberless penton base 

25 around the five-fold axis where the fiber would normally sit. The Ad5 

reconstructions confirm that capsid assembly, including addition of penton base 

to the vertices, is able to proceed in the complete absence of fiber. 

6. Integrin-Dependent Infectivity of Fiberless Ad5.Bgal.AF 
Particles 

30 While attachment via the viral fiber protein is a critical step in the 

infection of epithelial cells, an alternative pathway for infection of certain 
hematopoietic cells has been described. In this case, penton base mediates 
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binding to the cells (via B2 integrins) and internalization (through interaction with 
ovintegrins). Particles lacking fiber might therefore be expected to be 
competent for infection of these cells, even though on a per-particle basis they 
are several thousand-fold less infectious than normal Ad vectors on epithelial 



To investigate this, THP-1 monocytic cells were infected with 
AdS.fcgaLwt or with Ad5.Bgal.AF grown in the absence of fiber. Infection of 
THP-1 ceils was assayed by infecting 2 x 10 s cells at the indicated m.o.i. in 0.5 
ml of complete RPMK Forty-eight hours post-infection, the cells were fixed with 
glutaraldehyde and stained with X-gal, and the percentage of stained cells was 
determined by light microscopy. The results of the infection assay showed that 
the fiberless particles were only a few-fold less infectious than first-generation 
Ad on THP-1 cells. Large differences were seen in plaquing efficiency on 
epithelial (21 1B) cells. Infection of THP-1 cells by either Ad5.Bgal.AF or 
Ad5.£gal.wt was not blocked by an excess of soluble recombinant fiber protein, 
but could be inhibited by the addition of recombinant penton base). These 
results indicate that the fiberless Ad particles use a fiber-independent pathway to 
infect these cells. Furthermore, the lack of fiber protein did not prevent 
Ad5.fcgalAF from internalizing into the cells and delivering its genome to the 
nucleus, demonstrating that fiberless particles are properly assembled and are 
capable of uncoating. 

The foregoing results with the recombinant viruses thus produced 
indicates that they can be used as gene delivery tools in cultured cells and in 
vivo as described more fully in the Examples. For example, for studies of the 
effectiveness and relative immunogeniclty of multiply-deleted vectors, virus 
particles are produced by growth in the packaging lines described in Example 1 
and are purified by CsCI gradient centrifugation. Following titering, virus 
particles are administered to mice via systemic or local injection or by aerosol 
delivery to lung. The LacZ reporter gene allows the number and type of cells 
which are successfully transduced to be evaluated. The duration of transgene 
expression is evaluated in order to determine the long-term effectiveness of 
treatment with multiply-deleted recombinant adenoviruses relative to the 
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standard technologies which have been used in clinical trials to date. The 

immune response to the improved vectors described here is determined by 

assessing parameters such as inflammation, production of cytotoxic T 

lymphocytes directed against the vector, and the nature and magnitude of the 

5 antibody response directed against viral proteins. 

Versions of the vectors which contain therapeutic genes such as CFTR 

for treatment of cystic fibrosis or tumor suppressor genes for cancer treatment 

are evaluated in the animal system for safety and efficiency of gene transfer and 

expression. Following this evaluation, they are used as experimental therapeutic 

10 agents in human clinical trials. 

B. Retargeting of Adenoviral Gene Delivery Vectors by Producing 
Viral Particles Containing Different or Altered Fiber Proteins 

As the specificity of adenovirus binding to target cells is largely 

determined by the fiber protein, viral particles that incorporate modified fiber 

1 5 proteins or fiber proteins from different adenoviral serotypes (pseudotyped 
vectors) have different specificities. Thus, the methods of expression of the 
native Ad5 fiber protein in adenovirus packaging cells as described above is also 
applicable to production of different fiber proteins. 

Chimeric fiber proteins can be produced according to known methods 

20 (see, e.g., Stevenson et al. (1995) J. Virol., 55:2850-2857). Determinants for 
fiber receptor binding activity are located in the head domain of the fiber and an 
isolated head domain is capable of trimerization and binding to cellular receptors. 
The head domains of adenovirus type 3 (Ad3) and Ad5 were exchanged in order 
to produce chimeric fiber proteins. Similar constructs for encoding chimeric fiber 

25 proteins for use in the methods herein are contemplated. Thus, instead of the 
using the intact Ad5 fiber-encoding construct prepared in above and in U.S. 
application Serial No. 09/482,682) as a complementing viral vector in adenoviral 
packaging cells, the constructs described herein are used to transfect cells along 
with E4 and/or E1 -encoding constructs. 

30 Briefly, full-length Ad5 and Ad3 fiber genes were amplified from purified 

adenovirus genomic DNA as a template. The Ad5 and Ad3 nucleotide 
sequences are available with the respective GenBank Accession Numbers 
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M18369 and M1241 1. Oligonucleotide primers are designed to amplify the 
entire coding sequence of the full-length fiber genes, starting from the start 
codon, ATG, and ending with the termination codon TAA. For cloning purposes, 
the 5' and 3' primers contain the respective restriction sites BamHI and Notl for 
5 cloning into pcDNA plasmid as described in Example 1 A. PCR is performed as 
described above. 

The resulting products are then used to construct chimeric fiber 
constructs by PCR gene overlap extension (Horton et at. (1990) BioTechniques, 
8:525-535). The Ad5 fiber tail and shaft regions (5TS; the nucleotide region- 

10 encoding amino acid residue positions 1 to 403) are connected to the Ad3 fiber 
head region (3H; the nucleotide region encoding amino acid residue positions 
136 to 319) to form the 5TS3H fiber chimera. Conversely, the Ad3 fiber tail and 
shaft regions (3TS; the nucleotide region encoding amino acid residues positions 
1 to 135) are connected to the Ad5 fiber head region (5H; the nucleotide region 

15 encoding the amino acid residue positions 404 to 581) to form the 3TS5H fiber 
chimera. The fusions are made at the conserved TLWT (SEQ ID NO: 19) 
sequence at the fiber shaft-head junction. 

The resultant chimeric fiber PCR products are then digested with BamHI 
and Notl for separate directional ligation into a similarly digested pcDNA 3.1. 

20 The TPL sequence is then subcloned into the BamHI as described in Example 1 A 
for preparing an expression vector for subsequent transfection into 211 cells as 
described above or into the alternative packaging cell systems as previously 
described. The resultant chimeric fiber construct-containing adenoviral 
packaging cell lines are then used to complement adenoviral delivery vectors as 

25 previously described. Other fiber chimeric constructs are obtained with the 
various adenovirus serotypes using a similar approach. 

In an alternative embodiment, the use of modified proteins including with 
modified epitopes (see, e.g., Michael et at. (1995) Gene Therapy, 2:660-668 
and International PCT application Publication No. WO 95/26412, which describe 

30 the construction of a cell-type specific therapeutic viral vector having a new 

binding specificity incorporated into the virus concurrent with the destruction of 
the endogenous viral binding specificity). In particular, the authors described the 
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production of an adenoviral vector encoding a gastrin releasing peptide (GRP) at 
the 3' end of the coding sequence of the Ad5 fiber gene. The resulting fiber- 
GRP fusion protein was expressed and shown to assemble functional fiber 
trimers that were correctly transported to the nucleus of HeLa cells following 
5 synthesis. 

Similar constructs are contemplated for use in the complementing 
adenoviral packaging cell systems for generating new adenoviral gene delivery 
vectors that are targetable, replication-deficient and less immunogenic. 
Heterologous ligands contemplated for use herein to redirect fiber specificity 

10 range from as few as 10 amino acids in size to large globular structures, some of 
which necessitate the addition of a spacer region so as to reduce or preclude 
steric hindrance of the heterologous ligand with the fiber or prevent trimerization 
of the fiber protein. The ligands are inserted at the end or within the linker 
region. Preferred ligands include those that target specific cell receptors or 

15 those that are used for coupling to other moieties such as biotin and avidin. 
A preferred spacer includes a short 1 2 amino acid peptide linker 
composed of a series of serines and alanine flanked by a proline residue at each 
end using routine procedures known to those of skill in the art. The skilled 
artisan will be with the preparation of linkers to accomplish sufficient protein 

20 presentation and to alter the binding specificity of the fiber protein without 
compromising the cellular events that follow viral internalization. Moreover, 
within the context of this disclosure, preparation of modified fibers having 
ligands positioned internally within the fiber protein and at the carboxy terminus 
as described below are contemplated for use with the methods described herein. 

25 

The preparation of a fiber having a heterologous binding ligand is prepared 
essentially as described in the above-cited paper. Briefly, for the ligand of 
choice, site-directed mutagenesis is used to insert the coding sequence for a 
linker into the 3' end of the Ad5 fiber construct in pCLF as prepared in Example 
30 1. 

The 3' or antisense or mutagenic oligonucleotide encodes a preferred 
linker sequence of ProSerAlaSerAlaSerAlaSerAlaProGlySer (SEQ ID NO: 20) 
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followed by a unique restriction site and two stop codons, respectively, to allow 
the insertion of a coding sequence for a selected heterologous ligand and to 
ensure proper translation termination. Flanking this linker sequence, the 
mutagenic* oligonucloetide contains sequences that overlap with the vector 
5 sequence and allow its incorporation into the construct. Following mutagenesis 
of the pCLF sequence adding the linker and stop codon sequences, a nucleotide 
sequence encoding a preselected ligand is obtained, linkers corresponding to the 
unique restriction site in the modified construct are attached and then the 
sequence is cloned into linearized corresponding restriction site. The 

10 resultant fiber-ligand construct is then used to transfect 21 1 or the alternative 
cell packaging systems previously described to produce complementing viral 
vector packaging systems. 

In a further embodiment, intact fiber genes from different Ad serotypes 
are expressed by 21 1 cells or an alternative packaging system as previously 

15 described. A gene encoding the fiber protein of interest is first cloned to create 
a plasmid analogous to pCLF, and stable cell lines producing the fiber protein are 
generated as described above for Ad5 fiber. The adenovirus vector described 
which lacks the fiber gene is then propagated in the cell line producing the fiber 
protein relevant for the purpose at hand. As the only fiber gene present is the 

20 one in the packaging cells, the adenoviruses produced contain only the fiber 
protein of interest and therefore have the binding specificity conferred by the 
complementing protein. Such viral particles are used in studies such as those 
described above to determine their properties in experimental animal systems. 

EXAMPLE 3 

25 Tripartite leader sequences (TPLs) that are useful in enhancing the 

expression of complementing adenoviral proteins, particularly fiber protein, for 
use in preparing an adenoviral gene delivery vector are provided. The complete 
Ad5 TPL was constructed by assembling PCR fragments. First, the third TPL 
exon (exon 3) (nt 9644-9731 of the Ad5 genome) was amplified from Ad5 

30 genomic DNA using the synthetic oligonucleotide primers 

5'CTCAACAATTGTGGATCCGTACTCC3'(SEQ ID No. 24) and 
5'GTGCTCAGCAGATCTTGCGACTGTG3' (SEQ ID No. 25). The resulting 
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product was cloned to the BamHI and Bglll sites of pAE1Sp1a (Microbix 
Biosystems) using sites in the primers (shown in bold) to create plasmid pDV52. 
A fragment corresponding to the first TPL exon (exon 1), the natural first intron 
(intron 1), and the second TPL exon (exon 2) (Ad5 nt 6049-7182) was then 
5 amplified using primers 5'GGCGCGTTCGGATCCACTCTCTTCC3' (SEQ ID No. 
26) and 5'CTACATGCTAGGCAGATCTCGTTCGGAG3' (SEQ ID No. 27),and 
cloned into the BamHI site of pDV52 (again using sites in the primers) to create 
pDV55. 

This plasmid contains a 1 .2 kb BamHI/Bglll fragment containing the first 
10 TPL exon, the natural first intron, and the fused second and third TPL exons. 
The nucleotide sequence of the complete TPL containing the noted 5' and 3' 
restriction sites is shown in SEQ ID No 28 with the following nucleotide regions 
identified: 1-6 nt BamHI site; 7-47 nt first leader segment (exon 1); 48-1068 nt 
natural first intron (intron 1); 1069-1140 nt second leader segment (exon 2); 
15 1 141-1 146 nt fused BamHI and Bglll sites; 1 147-1 234 nt third leader segment 
(exon 3); and 1235-1240 nt Bglll site. 

EXAMPLE 4 

Deposit of Materials 

The following cell lines and plasmids were deposited on September 25, 

20 1996, with the American Type Culture Collection, 10801 University Blvd, 

Manassas, Virginia, USA (ATCC) under the provisions of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure and the Regulations thereunder (Budapest Treaty): Plasmid 
pE4/Hygro (accession number 97739), Plasmid pCLF (accession number 97737), 

25 21 1 Cell Line (accession number CRL-12193) and 21 1 A Cell Line (accession 
number CRL-12194) 

The following virus, Ad5./?gal.AF, was deposited on January 15, 1999, 
with the ATCC as listed above and provided with accession number VR2636. 

Additionally, plasmids pDV60, pDV67, pDV69, pDV80 and pDV90 were 

30 deposited at the ATCC on January 5, 2000 and provided with accession 
numbers PTA-1 144, PTA-1 145, PTA-1 146, PTA-1 147 and PTA-1 148 
respectively. 
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EXAMPLE 5 

Preparation and Use of Adenoviral Packaging Cell Lines 
Containing Plasmids Containing Alternative TPLs 

Plasmids containing tripartite leaders (TPLs) have been constructed. The 

5 resulting plasmids that contain different selectable markers, such as neomycin 

and zeocin, were then used to prepare fiber-complementing stable cell lines for 

use as for preparing adenoviral vectors. 

A. pDV60 

Plasmid pDV60 was constructed by inserting this TPL cassette of SEQ ID 
10 No. 28 into the BamHI site upstream of the Ad5 fiber gene in pcDNA3/Fiber r a 
neomycin selectable plasmid (see, e.g., U.S. application Serial No. 09/482,682 
(also filed as International PCT application No. PCT/US00/00265 on January 14, 
2000); see also Von Seggern et aL 

(1998) J. Gen Virol., 79: 1461-1468). The nucleotide sequence of pDV60 is 
15 listed in SEQ ID NO: 29. Plasmid pDV60 has been deposited in the ATCC under 
accession number PTA-1144. 

B. pDV61 

To construct pDV61, an Asp718/Notl fragment containing the CMV 
promoter, partial Ad5 TPL, wildtype Ad5 fiber gene, and bovine growth hormone 

20 terminator was transferred from pCLF (ATCC accession number 97737; and 
described in copending U.S. application Serial No. 09/482,682 (also filed as 
International PCT application No. PCT/US00/00265 on January 14, 2000);), to a 
zeocin selectable cloning vector referred to as pCDNA3.1/Zeo ( + ) (commerically 
available from Invitrogen and for which the sequence is known). 

25 C. pDV67 

In an analogous process, pDV67 containing complete TPL was 
constructed by transferring an Asp 718/Xbal fragment from pDV60 into 
pcDNA3.1/Zeo{ + ) backbone. The nucleotide sequence of pDV67 is set forth in 
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SEQ ID No. 30. Plasmid pDV67 is available from the ATCC under accession 
number PTA-1 145. 

D, pDV69 

To prepare pDV69 containing a modified fiber protein, the chimeric 
5 Ad3/Ad5 fiber gene was amplified from pGEM5TS3H (Stevenson et aL (1995) J. 
ViroL, 63: 2850-2857) using the primers 5'ATGGGAT 
CAAGATGAAGCGCGCAAGACCG3' (SEQ ID No. 31) and 

5XACTATAGCGGCCGCATTCTCAGTCATCTT3' (SEQ ID No. 32), and cloned to 
the BamHI and Notl sites of pcDNA3.1/Zeo( + ) via new BamHI and Notl sites 
10 engineered into the primers to create pDV68. Finally, the complete TPL 

fragment described above was then added to the unique BamHI site of pDV68 
to create pDV69. The nucleotide sequence of pDV69 is listed in SEQ ID No. 33 
and has been deposited in the ATCC under accession number PTA-1 146. 

E. Preparation of Stable Adenovirus Packaging Cell Lines 

15 E1-2a SB cells are derivatives of the A549 lung carcinoma line (ATCC # 

CCL 185) with chromosomal insertions of the plasmids pGRE5-2.E1 (also 
referred to as GRE5-E1-SV40-Hygro construct and listed in SEQ ID No. 34) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 35), which provide complementation of the adenoviral E1 and E2a 

20 functions, respectively. This line and its derivatives were grown in Richter's 
modified medium (BioWhitaker) + 10% FCS. E1-2a S8 cells were 
electroporated as previously described (Von Seggern et aL (1998) J. Gen ViroL, 
79: 1461-1468) with pDV61, pDV67, or with pDV69, and stable lines were 
selected with zeocin (600//g/ml). 

25 The cell line generated with pDV61 is designated 601. The cell line 

generated with pDV67 is designated 633 while that generated with pDV69 is 
designated 644. Candidate clones were evaluated by immunof luorescent 
staining with a polyclonal antibody raised against the Ad2 fiber. Lines 
expressing the highest level of fiber protein were further characterized. 

30 For the S8 cell complementing cell lines, to induce E1 expression, 0.3 //M 

of dexamethasone was added to cell cultures 1 6-24 hours prior to challenge 
with virus for optimal growth kinetics. For preparing viral plaques, 5 X 10 5 
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celis/well in 6 well plates are prepared and pre-induced with the same 
concentration of dexamethasone the day prior to infection with 0.5 //M included 
at a final concentration in the agar overlay after infection. 
F. Development of Cell Lines for Complementation of E17E2a* Vectors 
5 The Adenovirus 5 genome was digested with Seal enzyme, separated on 

an agarose gel, and the 6,095 bp fragment containing the left end of the virus- 
genome was isolated. The complete Adenovirus 5 genome is registered as 
Genbank accession #M73260, incorporated herein by reference, and the virus is 
available from the American Type Culture Collection, Manassas, Virginia, U.S.A., 

10 under accession number VR-5. The Seal 6,095 bp fragment was digested 

further with Clal at bp 917 and Bglll at bp 3,328. The resulting 2,41 1 bp Clal to 
Bglll fragment was purified from an agarose gel and ligated into the superlinker 
shuttle plasmid pSE280 (Invitrogen, San Diego, CA), which was digested with 
Clal and Bglll, to form pSE280-E. 

1 5 Polymerase chain reaction {PCR) was performed to synthesize DNA 

encoding an Xhol and Sail restriction site contiguous with Adenovirus 5 DNA bp 
552 through 924. The primers which were employed were as follows: 
5' end, Ad5 bp 552-585: 

5 '-GTC ACTCG AG G ACTCGGTC-G ACTG A A A AT6 AG AC ATATTATCTGCC ACGG A 

20 CC-3' (SEQ ID No 36) 

3' end, Ad5 bp 922-891: 

5'-CGAGATCGATCACCTCCGGTACAAGGTTTGGCATAG-3' (SEQ ID No. 37) 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment A) then was digested with Xhol and Clal, which cleaves at the native 
25 Clal site {bp 917), and ligated to the Xhol and Clal sites of pSE280-E, thus 

reconstituting the 5' end of the E1 region beginning 8 bp upstream of the ATG 
cbdon. 

PCR then was performed to amplify Adenovirus 5 DNA from bp 3,323 
through 4,090 contiguous with an EcoRI restriction site. The primers which 
30 were employed were as follows: 
5' end, Ad5 bp 3323-3360: 



• 
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5'-CATGAAGATCTGGAAGGTGCTGAGGTACGATGAGACC-3' (SEQ ID No. 38); 



3' end, Ad5 bp 4090-4060: 

5'-GCGACTTAAGCAGTCAGCTG-AGACAGCAAGACACTTGCTTGATCCAAATCC 

-3' (SEQ ID No. 39). 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment B) was digested with Bglll, thereby cutting at the Adenovirus 5 Bglll 
site (bp 3,382) and EcoRI, and ligated to the Bglll and EcoRI sites of pSE280-AE 
to reconstruct the complete E1a and E1b region from Adenovirus 5 bp 552 
through 4,090. The resulting plasmid is designated pSE280-E1. 

A construct containing the intact E1a/b region under the control of the 
synthetic promoter GRE5 was prepared as follows. The intact E1a/b region was 
excised from pSE280-E1, which was modified previously to contain a BamHI site 
3' to the E1 gene, by digesting with Xhol and BamHI. The Xhol to BamHI 
fragment containing the E1a/b fragment was cloned into the unique Xhol and 
BamHI sites of pGRE5-2/EBV (U.S. Biochemicals, Cleveland, Ohio) to form 
pGRE5-E1 ). 

Bacterial transformants containing the final construct were identified. 
Plasmid DNA was prepared and purified by banding in CsTFA prior to use for 
transfection of cells. 

Construction of plasmid including Adenovirus 5 E2A sequence. 

The Adenovirus 5 genome was digested with BamHI and Spel, which cut 
at bp 21,562 and 27,080, respectively. Fragments were separated on an 
agarose gel and the 5,518 bp BamHI to Spel fragment was isolated. The 5,518 
bp BamHI to Spel fragment was digested further with Smal, which cuts at bp 
23,912. The resulting 2,350 bp BamHI to Smal fragment was purified from an 
agarose gel, and ligated into the superlinker shuttle plasmid pSE280, and 
digested with BamHI and Smal to form pSE280-E2 BamHI-Smal. 

PCR then was performed to amplify Adenovirus 5 DNA from the Smal site 
at bp 23,912 through 24,730 contiguous with Nhel and EcoRI restriction sites. 
The primers which were employed were as follows: 
5' end, Ad5 bp 24,732-24,708: 
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5'-CACGAATTCGTCAGCGCTTCTCGTCGCGTCCAAGACCC-3' (SEQ ID No. 40); 
3' end, Ad5 bp 23,912-23,934: 

5'-CACCCCGGGGAGGCGGCGGCGACGGGGACGGG-3' (SEQ ID No. 41) 

This amplified DNA fragment was digested with Smal and EcoRI, and 
ligated to the Smal and EcoRI sites of pSE280-E2 Bam-Sma to reconstruct the 
complete E2a region from Ad5 bp 24,730 through 21,562. The resulting 
construct is pSE280-E2a. 

In order to convert the BamHI site at the 3' end of E2a to a Sail site, the 
E2a region was excised from pSE280-E2a by cutting with BamHI and Nhel, and 
recloned into the unique BamHI and Nhel sites of pSE280. Subsequently, the 
E2a region was excised from this construction with Nhel and Sail in order to 
clone into the Nhel and Sail sites of the pMAMneo (Clonetech, Palo Alto, CA) 
multiple cloning site in a 5' to 3' orientation, respectively. The resulting 
construct is pMAMneo E2a. 

Bacterial transformants containing the final pMAMneo-E2a were 
identified. Plasmid DNA was prepared and purified by banding in CsTFA. 
Circular plasmid DNA was linearized at the Xmnl site within the ampicillin 
resistance gene of pMAMneo-E2a, and further purified by the phenol/chloroform 
extraction and ethanol precipitation prior to use for transfection of cells. 
Transfection and selection of cells. 

In general, this process involved the sequential introduction, by calcium 
phosphate precipitation, or other means of DNA delivery, of two plasmid 
constructions each with a different viral gene, into a single tissue culture cell. 
The cells were transfected with a first construct and selected for expression of 
the associated drug resistance gene to establish stable integrants. Individual cell 
clones were established and assayed for function of the introduced viral gene. 
Appropriate candidate clones then were transfected with a second construct 
including a second viral gene and a second selectable marker. Transfected cells 
then were selected to establish stable integrants of the second construct, and 
cell clones were established. Cell clones were assayed for functional expression 
of both viral genes. 
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A549 (ATCC Accession No. CCL-185) were used for transfection. 
Appropriate selection conditions were established for G418 and hygromycin B by 
standard kill curve determination. 

Transfection of A549 cells with plasmids including E1 and E2a regions. 

pMAMNeo-E2a was linearized with Xmnl with the Amp R gene, introduced 
into cells by transfection, and cells were selected for stable integration of this 
plasmid by G418 selection until drug resistant colonies arose. The clones were 
isolated and screened for E2a expression by staining for E2a protein with a 
polyclonal antiserum, and visualizing by immunofluorescence. E2a function was 
screened by complementation of the temperature-sensitive mutant Ad5ts125 
virus which contains a temperature-sensitive mutation in the E2a gene. (Van Der 
Vliet, et al., J. Virology, Vol. 15, pgs. 348-354 (1975)). Positive clones 
expressing the E2a gene were identified and used for transfection with the 7 kb 
EcoRV to Xmnl fragment from pGRE5-E1 , which contains the GRE5 promoted 
E1a/b region plus the hygromycin fl gene. Cells were selected for hygromycin 
resistance and assayed for E1a/b expression by staining with a monoclonal 
antibody for the E1 protein (Oncogene Sciences, Uniondale, N.Y.). E1 function 
was assayed by ability to complement an E1 -deleted vector. At this point, 
expression and function of E2a was verified as described above, thus 
establishing the expression of E1a/b and E2a in the positive cell clones. 

A transfected A549 { A549 (ATCC Accession No. CCL-185);) cell 
lines showed good E1a/b and E2a expression and was selected for further 
characterization. It was designated the S8 cell line. 

G. Preparation of Adenoviral Vectors Containing Ad5.jffgal.AF Genome in S8 
Improved Fiber-Complementing Cell Lines 

To prepare adenoviral vectors containing Ad5.£gal.AF (Ad5.^gal.AF has 
been was deposited the ATCC under accession number VR2636) in S8 ceils 
containing alternative forms of TPL for enhancing the expression of fiber 
proteins, the protocol as described in Example 2 for preparing Ad5.£gal.AF in 
21 1 B cells was followed with the exception of pretreatment with 0.3 //M 
dexamethasone for 24 hours as described above. Thus, viral particles with the 
wildtype Ad5 fiber protein on their surface and containing the fiberless 
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Ad5.£gal.AF genome were produced in 633 cells. Particles produced in 644 
cells also contained the fiberless Ad5./?gal.AF genome, but had the chimeric 
5T3H fiber protein, with the Ad3 fiber knob, on their surface. 

Thus, these viral preparations, prepared as described herein, are useful for 
5 targeting delivery of the Ad5./?ga1.AF, Ad5.GFP.AF, or other similarly 

constructed fiberless genome with either wild-type or modified fibers. Preferably 
for purposes herein the fibers are from an Ad serotype D virus, more preferably 
from Ad37. 

EXAMPLE 6 

10 Pseudotyping and Infectivity of Recombinant Adenoviral Vectors 

Produced with Improved Fiber-Complementing Cell Lines 

A. Pseudotyping of Ad5./?gal.AF 

To verify that adenoviral vectors were produced had altered tropisms, 
viral particles were purified from either 633 or 644 cells and were then Western 
1 5 blotted and probed with a polyclonal rabbit antibody against the Ad2 fiber 
(which detects the Ad5 and chimeric 5T3H fiber proteins). 

B. Infectivity of Cells with 633 or 644 Generated Virus Particles The 
cell lines, 633 or 644, prepared as described above, were infected with the 
indicated number of particles/cell of Ad5./?gal.AF and virus particles produced. 

20 Virus was then used to infect selected cell lines, including 21 1 B, MRC-5 human 
fibroblasts, A-10 rat aortic endothelial cells, and THP-1 human monocytic cells. 
Unbound virus was removed by washing the cells and the cells were further 
incubated at 37 °C for 48 hours. Cells were then fixed with glutaraidehyde and 
stained with X-gal. The percentage of stained cells was then determined by light 

25 microscopy where all experiments were done in triplicate. 

The results indicated that adenoviral vectors could be retargeted by 
pseudotyping using packaging cell lines expressing different fiber proteins. 
Particles containing either fiber were equally infectious on 21 1 B cells, while 
MRC-5 fibroblasts and THP-1 cells were more readily infected by virus containing 

30 the chimeric fiber. The A-10 rat endothelial cells were more readily infected by 
particles containing the wildtype Ad5 fiber protein. 
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Transient Transcomplementatlon 

The ability of adenovirus type 5 (Ad5) to deliver therapeutic genes to cells 
is mediated by the interaction of the adenoviral fiber protein with the 
coxsackievirus-adenoviral receptor (CAR). Because a wide-range of celts express 
CAR, it was thought that it would be difficult to use adenoviruses to deliver 
genes to specific cell types. A system for testing modified fiber genes to 
identify tropisms of interest is described in copending U.S. application Serial 
No. 09/482,682 (also filed as International PCT application No. 
PCT/US00/00265 on January 14, 2000). An in vitro system has been 
developed that involves infection of tissue culture cells with a fiber-deleted Ad 
and transient co-transfection with a plasmid directing fiber expression. This 
system allows one to produce and evaluate modified fibers expressed on a viral 
particle. This system can be used to produce therapeutic quantities of 
adenoviral vectors with modified fiber proteins, with such fibers having a new 
tropism added by insertion of a desired ligand into the fiber gene. These fibers 
may also have the natural tropism (/.e. binding to CAR) ablated. 

Plasmids used were pDV60 and pDV55 were prepared as described 
herein and in U.S. application Serial No. 09/482,682 (also filed as International 
PCT application No. PCT/USOO/00265 on January 14, 2000). pDV60 is an 
pcDNA3.1 -based expression plasmid that contains the CMV promoter, Ad5 
tripartite leader, an intron, and the Ad5 fiber gene sequence. pDV55 contains 
no fiber gene and serves as the negative control. Ad5.£gal.AF and 21 1B are 
described above. 293T ceils are identical to 293 cells except they express an 
integrated SV40 large T antigen gene. HDF cells are human diploid fibroblasts. 
293T cells express CAR and a v integrins; HDF cells express a v integrins but no 
CAR. Transfections with fiber expression plasmids were performed with 
Lipofectamine (GIBCO-BRL) using 20mg DNA and 50ml Lipofectamine per 15cm 
dish. Cells were maintained in DMEM supplemented with 10% fetal bovine 



The fiber deletion mutation of Ad5.£gaLAF is complemented in trans by 
passaging virions through 21 1B, a cell line that stably expresses functional Ad5 



serum. 
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fiber. The present system was designed to complement Ad5./?gal.AF by 
modified fibers expressed from transfected episomal plasmids in 293T cells. The 
result is a simplified and rapid method to incorporate modified fibers on a viral 
particle containing the Ad5./?gal.AF genome that does not require propagation of 
5 the virus. 

The feasibility of transcomplementation of Ad5.£gaI.AF with episomal 
fiber-expressing plasmids was demonstrated in the following experiment. 293T 
cells were transfected with one of two plasmids: pDV55, which expresses no 
fiber or pDV60, which expresses wildtype Ad5 fiber. Fiber expression persists 

10 for at least six days. Twenty-four hours after transfection, these cells were 
infected at 2000 particles/cell with Ad5.£gal.AF passaged through 21 1B cells. 
Seventy-two hours later, a crude viral lysate (CVL) was generated by exposing 
the cells to five freeze-thaw cycles. Viral particles were purified by cesium 
chloride gradient centrifugation. The resulting virions incorporated the fiber 

15 expressed from the episomal plasmid, as confirmed by Western blots performed 
with an antibody specific to the Ad5 fiber. 

Episomal plasmid transcomplementation system is suitable for quickly 
expressing and evaluating the properties of modified fibers in the context of a 
viral particle. Episomal plasmid transcomplementation will also be of great utility 

20 for quickly evaluating a bank of modified fibers for other binding properties, 

including new tropisms and the ablation of the native tropism. In addition to the 
rapid generation and testing of large numbers of modified fibers, there are other 
advantages to the Ad5.£gal.AF transcomplementation system in terms of 
production and safety. Episomal plasmid transcomplementation has the inherent 

25 advantage over transcomplementation in that it is not necessary to make a 

stable cell line for every modified fiber for complementation with Ad5./?gal.AF. 
Because the Ad5.£gal.AF is deleted in El, E3 and fiber, there is an additional 
gene deletion, which should render it very suitable for gene therapy. In addition, 
the presence of the fiber gene deletion decreases the opportunity to generate 

30 replication-competent virus via recombination in the packaging cells. A single 
Ad vector preparation can be retargeted to any number of different cell types 
simply by transfecting the cells with the appropriate fiber-expression construct. 
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Preparation of Adenoviral Gene Delivery Vectors Containing the Ad37 fiber 
protein 

This example describes construction of packaging cell lines expressing 
the Ad37 fiber protein, and their use in generating particles of a fiber-deleted Ad 
vector {such as Ad5.jffgal.AF) containing this fiber protein. The fiber protein is 
attached to the viral capsid by binding to the penton base protein through its N- 
terminus, and the Ad37 fiber was modified in order to make its N-terminal 
sequence more closely match that of the Ad5 protein to ensure that it would 
efficiently bind the Ad5 penton base in these vectors. 

A. Materials and methods 

Cell lines and wild-type adenovirus. Human A549 lung carcinoma 
epithelial cells and human Chang C conjunctival cells (American Type Culture 
Collection) were maintained in complete Dulbecco's Modified Eagle Medium 
(DMEM) with 10% fetal bovine serum. Wild-type Ad19p and Ad37 (ATCC) were 
propagated in A549 cells and purified by banding on CsCI 2 density gradients as 
previously described (Huang et aL (1999) J. V/roL 73:2798-2802). Viral protein 
concentration was determined by the Bio-Rad Protein Assay, and was used to 
calculate the number of viral particles based on the known molecular weight of 
Ad2 virions (1 jjg = 4 x 10 9 particles). 

B. Construction of the Ad37 fiber expressing cell lines and the recombinant 
Ad37 knob protein. 

1 . Construction of an Expression Plasmid for the Ad37 Fiber Protein 
(pDV80) 

The plasmid designated pDV80 (see, SEQ ID No. 42) prepared for 
expression of the Ad37 fiber protein in mammalian cells, uses the same 
regulatory elements as the elements in pDV60, pDV67, and pDV69 to express 
the Ad37 fiber in packaging lines. It was constructed in two steps. 

First, the Ad37 fiber open reading frame was amplified from Ad37 
genomic DNA using synthetic oligonucleotide primers, L37: 5' TGT CCT GGA 



ID NO. 44). L37 contains nucleotides (underlined) that differ from the Ad37 



TCC AAG ATG AAG CGC GCC CGC CCCAGC GAA GAT GAC TTC 3'<SEQ ID 
NO. 43) and 37FR: 5' AAA CAC GGC GGC CGC TCT TTC ATT CTT G 3' SEQ 
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genomic sequence in order to add a unique BamW site (bold) before the start 
codon (italicized) and to create point mutations that make the N-terminal 
sequence of the fiber more closely match the N-terminal sequence of the Ad5 
fiber protein as follows: 
5 Ad37 MSKRLRVEDDFNPVYPY (SEQ ID No. 45) 

KRARPS (SEQ ID No. 46) 

Ad5 MKRARPSEDTFNPVYPY (SEQ ID No . 47) . 

37 FR also incorporates a unique /Vof1 site (bold). The PCR product was inserted 
10 into the fismHI and /VoM sites of pCDNA3.1zeo{ + ) (Invitrogen) to create 
pDV78. The correct sequence of the Ad37 fiber protein, including inserted 
changes, was confirmed by sequencing. 

Two point mutations in the fiber gene in the 705 line, S356 to P356 and 
I362 to T362, were discovered by the sequencing. The mutations are not in the 
1 5 receptor binding domain in Ad37 fiber gene in the 705 cell line. They are buried 
in the knob trimer interface. To confirm that the these mutations do not affect 
receptor binding, the Ad37 fiber protein with the correct sequence was recloned, 
and 293T cells transfected with the virus and subsequently infected with 
AdS.GFPAF to produce Ad37 pseudotyped virus. The results were the same as 
20 the results of the experiments with Ad37 pseudotyped virus produced from line 
705 {see, Wu et at. (2001 ) Virology 273:78-89). 

Second, a 1 .2 kb Bam Hl/Bgf II fragment containing an adenovirus type 5 
tripartite leader was excised from pDV55 (see EXAMPLE 3) and inserted into the 
Bam H1 site of pDV78 to create pDV80 (SEQ ID No 42). Plasmid pDV80 has 
25 been deposited in the ATCC under accession number PTA-1 147. 

2. Construction of the recombinant Ad37 knob protein 
Recombinant Ad37 knob protein containing an N-terminal T7»Tag was 
produced in E coti using the PET expression system (Novagen). Ad37 fiber DNA 
(GenBank accession number U69132) was PCR amplified from wild-type Ad37 
30 genomic DNA using the following primers (SEQ ID Nos. 48 and 49): 
5' GGATCC 4 7"GGGATACTTGGTAGCA 3' (BamH\ site underlined and 
5' GCAACTCGAGTCATTCTTGGGCAATATAGG 3'{XhoI site underlined). 
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The PCR reactions were performed at 94 °C (denaturation), 55 °C 
(annealing), 72 °C {extension, 30 cycles) using Taq DNA polymerase (Qiagen). 
The amplified DNA fragments, which contained residues 172 to 365 of the Ad37 
fiber protein with the addition of an N-terminal start codon (italicized), were 
purified and subcloned into the pCR-TOPO vector using the TA-Cloning Kit 
(Invitrogen). No replication errors were found by DNA sequencing. Plasmids 
from cultured transformed colonies were purified and digested with BamH\ and 
Xho\. The fragment was inserted into the BamW\ and Xho\ sites of the bacterial 
expression vector, pET21a (Novagen), and transformed into (DE3)pLYS S 
expression cells (Invitrogen). Colonies were selected for knob expression by 
induction with 1 mM IPTG for four hours at 37 °C and knob expression was 
determined by SDS-PAGE. The colony displaying highest knob expression was 
used for large-scale knob expression and induced with 0.5 mM IPTG at 30 °C for 
four hours. 

The recombinant T7«Tagged Ad37 knob protein was purified from 
sonicated bacteria using the T7*Tag Affinity Purification Kit as recommended by 
the manufacturer (Novagen). Recovered protein was analyzed for purity by SDS- 
PAGE followed by Coomassie staining or Western blotting with an HRP- 
conjugated a-T7«Tag monoclonal antibody as described by the manufacturer 
(Novagen) or an cr-Ad37 fiber rabbit antibody. 

3. Preparation of Cell Lines that Express the Ad37 fiber protein 

Plasmid pDV80 DNA was purified using the Qiagen method and 
electroporated into the adenovirus-complementing cell line E1-2a S8 (see 
Examples herein; see also, Gorziglia etaL (1996) J. Virology 70:4173-4178; 
and Von Seggern etaL (1 998) J. Gen. ViroL 75:1461-1468). Stable clones were 
selected with 600 pg/m\ zeocin (Invitrogen). 

Clones were expanded and were screened for fiber expression by indirect 
immunofluorescence (Von Seggern etaL { 1 998) J. Gen. ViroL 73:1461-1468) 
using a rabbit polyclonal antibody directed against the Ad37 fiber (a-Ad37 fiber 
rabbit antibody) raised by immunizing rabbits with recombinant Ad37 fiber 
protein. Two clones (lines 705 and 731) that expressed the protein at a 
uniformly high level were selected. 
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EXAMPLE 9 

Production of Pseudotyped Ad Vector Particles 

To generate vector particles equipped ('pseudotyped') with the Ad37 
fiber protein, the Ad37 fiber-expressing 705 cells were infected (approximately 
5 1000 particles/cell) with Ad5.0gal.AF or with Ad5.GFP.AF. 
. Materials and methods 

Ad5.iffgal.AF 

The construction of Ad5.0gal.AF is described in Example 2 (it has been 
. deposited on January 1 5, 1 999, with the ATCC as listed above under accession 
10 number VR2636; see also, Von Seggern eta/. (1999) J.Virol. 73:1601-1608; 
copending U.S. application Serial No. 09/482,682 filed January 14, 2000, and 
also International PCT application No. PCT/US00/00265, filed January 14, 
2000). 

Ad5.GFP.AF 

15 Ad5.GFP.AF was constructed by recombination in bacteria using a 

modification of the AdEasy System (see. U.S. Patent No. 5,922,576; see, also 
He etal. (1998) Proc. Natl. Acad. Sci. U.S.A. 35:2509-2514; the system is 
publicly available from the authors and other sources). 

First, a fiber-deleted genomic plasmid was constructed by removing the 

20 fiber gene from pAdEasy-1 (see, U.S. Patent No. 5.922,576; and He et a/. 
(1998) Proc. Natl. Acad. Sci. U.S.A. 35:2509-2514; the AdEasy system and 
vectors are publicly available from He et at. at Johns Hopkins University). 
Plasmid pAdEasy-1 contains the entire Ad5 genome, except for nucleotides 1- 
3,533, which encompass the E1 genes, and nucleotides 28„1 30-30,820, which 

25 encompass the E3 gene. 

Plasmid pDV43 (see Example 2; see, also Von Seggern etal. (1999) 
J.Virol. 73:1601-1608) was digested with Pact, the ends blunted by treatment 
with the large fragment of E coli DNA polymerase and dNTPs, and the product 
re-ligated to produce plasmd P DV76. The resulting plasmid pDV76 is identical to 

30 pDV43 except for loss of the /fed site and contains the right end of the Ad5 
genome with E3 and fiber deletions. A 4.23 kb fragment from PDV76 was 
amplified using the oligonucleotide primers (SEQ ID Nos. 50 and 51: 
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5' CGC GCT GAC TCT TAA GGA CTA GTT TC 3', including the unique Spe1 
site in the Ad5 genome (bold); and 5' GCG CTT AAT TAA CAT CAT CAA TAA 
TAT ACC TTA TTT T 3', including a new Pad site (bold) adjacent to the right 
Ad5 ITR. Hence the resulting PCR amplified fragment, contains nucleotides 
5 27,082 to 35,935 of the Ad5 genome with deletions of nucleotides 28,133 to 
32,743 (the E3 and fiber genes), and was used to replace the corresponding 
SpeWPac'l fragment of pAdEasy 1 (see, U.S. Patent No. 5,922,576) to create 
pDV77. 

Second, E. co// strain BJ5183 was electroporated with a mixture of 
10 pDV77 and Pmel -linearized pAdTrack as described (U.S. Patent No. 5,922,576; 
He etaL (1998) Proc. Natl. Acad. Sci. U.S.A. 55:2509-2514), and DNA was 
isolated from kanamycin-resistant colonies. The resulting plasmid, pDV83, 
contains a complete Ad5 genome with E1-, E3-, and fiber-deletions with a CMV- 
driven GFP reporter gene inserted at the site of the E1 deletion. The full length 
15 Ad chromosome was isolated by Pad digestion, and transfected into the El- 
and fiber-complementing 633 cells (Von Seggern etaL (2000) J.ViroL 74:354- 
362). The 633 cells were produced by electroporating pDV67 (SEQ ID No. 30, 
deposited under ATCC accession number PTA-1145) into the E1-2a S8 cells, 
described above. The recovered virus Ad5.GFP.AF was then plaque purified by 
20 plating on 633 cells and virus stocks were prepared by freeze-thawing cell 
pellets. 

Ad5-pseudotype particle production 

Particles with Ad 5 fiber 
Ad5-pseudotyped particles were generated by virus growth in 633 cells, 
25 which express the wild type Ad5 fiber protein. Viral particles were isolated and 
purified over CsCI gradients (Von Seggern etaL (1999) J.ViroL 73:1601-1608; 
purified by centrifugation on preformed 1 5-40% CsCI gradients (111 ,000 x g for 
three hours at 4°C)). For analysis of viral proteins, ten pg of the purified 
particles were electrophoresed on 8-16% gradient gels and the protein 
30 transferred to nylon membranes. The resulting blot was probed with rabbit 
polyclonal antibodies raised against recombinant Ad37 fiber or Ad5 fiber or 
penton base proteins expressed in baculovirus-infected cells. 
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Particles with Ad37 fiber 

Cells from the Ad37 fiber producing cell line 705 were infected at 
approximately 1000 particles/cell with Ad5.;ffgal.AF or with Ad5.GFP.AF. Viral 
particles were isolated and purified over CsCI gradients. The bands were 
harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1 , 0.9% NaCI, and 
10% glycerol), aliquoted and stored at -70°C. 

Viral protein analyses 

For analysis of viral proteins, 10 //g of purified Ad5.£gal.AF particles with • 
no fiber (grown in 293 cells), the Ad5 fiber (grown in 633 cells), or the Ad37 
fiber (grown in 705 cells) were electrophoresed by 8-16% polyacrylamide 
gradient SDS-PAGE. and the proteins were transferred to nylon membranes. The 
blot was then probed with a-Ad37 fiber rabbit antibody. Ad5 fiber and penton 
base were detected by reprobing the blot with polyclonal antibodies raised 
against recombinant proteins expressed in baculovirus-infected cells (Wickman et 
al. (1993) Cell 2:309-31 9). 

Adenovirus infection and cell binding assays 

Adherent Chang C and A549 cells were infected with GFP expressing 
Ad5 vectors containing the Ad5 fiber (Ad5.GFP.AF/5F) or the Ad37 fiber 
(Ad5.GFP.AF/37F) at 10,000 particles per cell for 3 hours at 37 °C, 5% C0 2 in 
DMEM, 10% FCS. Cells were washed twice with saline and then cultured 
overnight at 37° C, 5% C0 2 . The next day, the cells were detached with buffer 
containing 0.05% (w/v) trypsin and 0.5 mM EDTA (Boehringer Mannheim) for 5 
minutes at 37 °C. Suspended cells were washed once with PBS and then 
resuspended in phosphate-buffered saline (PBS), pH 7.4. GFP fluorescence was 
measurec j w jth a FACScan flow cytometer. A threshold established by the 
fluorescence of uninfected cells was used to distinguish cells expressing GFP. 
To assess the role of CAR in Ad infection, 1 0,000 attached cells were pre- 
incubated with 1 80 //g/ml RmcB, a function-blocking anti-CAR monoclonal 
antibody (Hsu eta/. (1988) J.Virol. 62'A 647-: 1 652), in complete DMEM for 1 
hour at 4°C. A small volume containing Ad5.GFP.AF/5F or Ad5.GFP.AF/37F 
was then added at 10,000 particles per cell. The cells were infected for 3 
hours, cultured overnight, harvested, and analyzed for GFP expression. Percent 
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cells expressing GFP was determined by the percent of cells detected above a 
threshold set by the fluorescence of uninfected Chang C cells. 

To measure adenovirus binding to cells, wild type Ad37 was labeled with 
12S I using lodogen (Pierce) according to manufacturer instructions and separated 
5 from free 125 l by gel filtration as described (Huang et al. (1999) J. Virol. 

73:2798-2802). Binding of radiolabeled wild type Ad37 on Chang C cells was 
then quantitated as described (Huang et aL (1999)*/ ViroL 73:2798-2802). 
Non-specific binding was determined by incubating cells and labeled Ad37 
particles in the presence of 100-fold concentration of unlabeled Ad37. Specific 

10 binding was calculated by subtracting the non-specific binding from the total 
cpm bound. To examine if divalent cations are required for binding, 1 0 mM 
ethylenediaminetetraacetic acid (EDTA) or various concentrations of CaCI 2 , or 
MgCI 2 were added to cells before incubation with labeled virus. To examine if 
the receptor for Ad37 is a protein, cells were pretreated with 1 0 //g/ml trypsin 

15 (GIBCO), subtilisin (Sigma), proteinase K (Boehringer-Mannheim), and bromelain 
(Sigman) at 37 "C for 1 hour, then washed twice with complete DMEM before 
adding labeled virus. Cells were >95% viable after protease treatment. 

Ad37 binding to conjunctival cells is calcium-dependent. Specific 125 l- 
labeled Ad37 binding to Chang C cells was measured in the presence of 10 mM 

20 EDTA and in the presence of varying concentrations of calcium chloride or 
magnesium chloride. Specific binding was determined by subtracting the 
nonspecific counts in the presence of 100-fold excess unlabeled virus from the 
total counts. 

Pretreatment of conjunctival cells with proteases inhibits Ad37 binding. 

25 Change C cells were pretreated with various proteases for 1 hour before binding 
125 l-iabeied Ad37 to the cells. Nonspecific binding was measured by adding 
100-fold unlabeled Ad37 to cells with 125 l-labeled Ad37 and subtracting from 
total counts for specific binding. Percent inhibition represents the difference in 
specific binding of untreated cells and pretreated cells as a percentage of the 

30 specific binding of untreated cells. 
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Virus overlay protein blot assay (VOPBA) 

For VOPBA of human conjunctival membrane proteins probed with Ad37 
in the presence of EDTA or calcium chloride, Chang C membrane fractions were 
separated by 8% SDS-PAGE and transferred to a PDVF membrane. The 
membrane was subsequently probed with or without whole Ad37 particles, a 
polyclonal antibody against Ad37 fiber, and finally a horseradish peroxidase 
conjugated anti-rabbit antibody, in the presence of EDTA or calcium chloride. 
Transferred Chang C membrane proteins were probed with recombinant Ad37 
knob protein, instead of Ad37 knob, in the presence of calcium chloride. 

Confluent monolayers of Chang C and A549 cells were detached by 
scraping, pelleted by centrifugation, and then resuspended in 250 mM sucrose, 
20 mM HEPES, pH 7.0, 1 mM EDTA, and 2 //g/ml aprotinin and leupeptin. Cells 
were transferred into a dounce homogenizer and disrupted with 30 strokes. 
Organelles and nuclei were pelleted at 500g for 1 5 min. Plasma membrane 
fragments were then pelleted from the supernatant of cell lysates at 200,000g 
for 1 hour and then resuspended in 10 mM Tris»CI, pH 8.1, 10 //g/ml aprotinin 
and leupeptin. 

Cell membranes of Chang C or A549 cells were incubated (1:1) with a 
2% SDS, non-reducing buffer and separated on an 8% polyacrylamide gel 
without boiling. Membrane proteins were then electrobiotted onto a PVDF 
membrane (Immobilon-P) and blocked in 5% <w/v) milk in PBS, pH 7.4, 0,02% 
Tween-20 (PBS-T). After blocking, the membrane was incubated with 1 //g/ml 
wild-type Ad19p or Ad37 in 0.5% (w/v) milk in PBS-T, 1 mM CaCI 2 , for 1 hour 
at room temperature. The membrane was then washed once with phosphate- 
buffered saline, pH 7.4 (PBS), 1 mM CaCI 2 , and incubated with 1:500 dilution of 
a-Ad37 fiber rabbit antibody in 0.5% (w/v) milk in PBS-T, ImM CaCI 2 , for 30 
minutes at room temperature. The membrane was washed again with PBS, 1 
mM CaCI 2 , and incubated with 1 :5000 dilution of horseradish peroxidase (HRP) 
conjugated a-rabbit antibody (Sigma) in 0.5% (w/v) milk in PBS-T, 1 mM CaCI 2/ 
for 30 minutes at room temperature. The membrane was washed four times in 
PBS, 1 mM CaC1 2 , once with PBS-T, 1 mM CaCI 2 , and once in 1 mM CaCI 2 . The 
blot was developed with enhanced chemiluminescence reagents (Pierce) for 5 
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minutes and placed onto a piece of Biomax film (Kodak) for 5 seconds to 1 

minute. For divalent metal cation experiments, membranes were incubated in 

the presence of 2mM EDTA instead of 1 mM CaCI 2 in all solutions. To assay 

fiber knob binding to cell membrane proteins, membrane filters were incubated 

5 with 1 //g/ml purified T7-tagged Ad37 knob protein in Tris-buffered saline, 0.1% 

Tween-20, 1 mM CaC1 2 , for 1 hour at room temperature. a-Ad37 fiber rabbit 

antibody and HRP-conjugated anti-rabbit antibody were applied and the 

membrane was developed with substrate solution as described above. 

Results: Comparison of adenovirus infection of human conjunctival and lung 
10 epithelial cells with virus particles retargeted with Ad5 or Ad37 fiber proteins 

Packaging cell lines producing the Ad37 fiber protein were generated. 

Since the N-terminal amino acid sequences of the Ad5 and Ad37 fiber proteins 

differ significantly, and to ensure that the Ad37 fiber would be efficiently 

incorporated into Ad5 vector particles, several residues in the wild-type Ad37 

15 fiber were mutated to more closely match the Ad5 sequence. Stable cell lines 
producing this fiber under control of the CMV promoter and the adenovirus type 
5 tripartite leader were then generated and screened for fiber expression by 
indirect immunofluorescence. One clone (line 705), which expressed the Ad37 
fiber at a high level, was selected for further study. 

20 Cells from one cell line 633, which expresses the wild-type Ad5 fiber 

protein, and line 705 were infected with a fiber-deleted Ad5 vector carrying a 
/?galactosidase reporter gene. The resulting vector particles contained the Ad5 
fiber protein (Ad5.£gal.AF/5F) and the Ad37 fiber protein (Ad5.jfrgaI.AF/37F), 
respectively. Incorporation of the correct fiber protein into viral particles was 

25 verified by Western blotting. Adenoviral vectors containing the GFP reporter 
gene, Ad5.GFP.AF/5F and Ad5.GFP.AF/37F, were created in the same fashion. 

Infection of a variety of cell types using the retargeted adenovirus 
particles was examined. As assayed by GFP fluorescence, Ad5.GFP.AF/5F 
exhibited good gene delivery to lung epithelial (A549) and conjunctival celis 

30 (Chang C). In contrast, Ad5.GFP.AF/37F efficiently delivered GFP to Chang C 
ceils, but exhibited very poor gene delivery to A549 cells. Although CAR is 
expressed on the surface of A549 cells, as indicated by AD5.GFP.AF/5F 
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infection, Ad5.GFP.AF/37F was unable to infect these cells efficiently. This 
experiment shows that the Ad37 fiber protein can confer preferential infection of 
human conjunctival cells, but not CAFt-expressing human lung epithelial cells. 

Hence CAR. is not the primary receptor for Ad37. Recent studies reported 
5 that expression of CAR on the surface of Chinese hamster ovary (CHO) cells did 
not improve Ad37 binding (Arnberg et aL (2000) J. Virol. 74:42-48), implying 
that Ad37 does not use CAR as a primary receptor. In order to verify this on 
human conjunctival cells, A549 and Chang C cells were pretreated with RmcB 
(Hsu et al. (1988) J. Virol. 52:1647-1652), a function-blocking monoclonal 

10 antibody against CAR. The RmcB antibody inhibited infection of A549 ceils by 
Ad5.GFP.AF/5F, but it had little effect on infection of Chang C cells by 
Ad5.GFP.AF/37F. This indicates that CAR is not the primary receptor for Ad37 
on Chang C conjunctival cells. 

Ad37 binding to conjunctival cells requires divalent metal cations. It has 

15 been proposed (Roelvink et aL (1998) J.Virol 72:7909-7915) that a combination 
of fiber binding to CAR and penton base binding to a v -integrins allows some 
adenovirus serotypes to attach to cells. Although a v -integrin binding to the RGD 
motif of the adenovirus penton base is of relatively low affinity (Wickman et aL 
(1 993) Cell 2:309-319), it may nonetheless contribute to viral attachment to the 

20 cell surface. Ad37 shows a particularly strong affinity for binding to integrin og? 5 
(Mathias et aL (1998) J.ViroL 72:8669-8675), suggesting that integrin cgffg 
might be a primary receptor for Ad37. Binding of the RGD motif by a v -integrins 
requires the presence of divalent cations, such as calcium or magnesium (Stuiver 
et aL (1996) J. Cell Physiol. / 55:521 -531 ). In contrast, no divalent cations were 

25 required for binding in the CAR- Ad 1 2 knob complex (Bewley et aL (1 999) 
Science 286A 579-1 583). 

To investigate the potential role of flr v -integrins and divalent metal cations 
in Ad37 receptor binding, 125 l-labeled Ad37 binding to Chang C cells was 
examined in the absence or presence of EDTA. EDTA inhibited Ad37 binding to 

30 conjunctival cells but did not alter Ad5 binding. These findings suggest a 
requirement for divalent metals for Ad37 binding. 
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The presence of either calcium or magnesium ions helps aj? 5 organize in focal 
contacts (Stuiver et al. (1 996) J. Cell PhysioL 755:521-531), suggesting that 
calcium and magnesium aid in integrin og9 5 function. To further test the 
potential role of integrin cg0 s in Ad37 cell attachment, 125 l-labeled Ad37 binding 
5 to Chang C cells was measured in the presence of varying concentrations of 
calcium or magnesium chloride. Magnesium ions had little effect on Ad37 
binding to Chang C cells. In contrast, calcium ions dramatically enhanced Ad37 
binding to Chang C cells. The optimal concentration of calcium chloride for 
Ad37 binding was 1 mM, while higher concentrations of calcium actually 

10 decreased virus binding to cells. The fact that calcium, but not magnesium, 
promoted Ad37 attachment is not consistent with integrin cgff s as the primary 
receptor for viral attachment to the cells since either metal will support ligand 
binding to integrin cgff 5 . Moreover, A549 cells express abundant a v -integrins 
(Mathias et ai. (1998) J.Virol. 72:8669-8675) but were unable to support 

15 efficient binding of Ad37. 

Wild-type Ad37 particles bind to three conjunctival membrane proteins. 
Recent studies reported that protease treatment of CHO cells abolished Ad37 
binding (Arnberg et al. (2000) J. ViroL 74:42-48), implying that Ad37 bound to 
a protein receptor on CHO cells. Scatchard analysis of Ad37 binding to Chang C 

20 cells showed that each cell expresses approximately 24,000 fiber binding sites 
(Huang et ai. (1999) J. Virol. 73:2798-2802). To determine if the Ad37 binding 
site on human conjunctival cells is also a protein, Chang C cells were treated 
with different proteases prior to measuring binding of 125 l-labeled Ad37. 
Digestion of surface proteins by all four proteases inhibited Ad37 binding to 

25 Chang C cells by greater than 50%. This finding showed that Ad37 also binds 
to a protein receptor on Chang C cells. 

Virus overlay protein blot assays (VOPBAs) were used to identify 
candidate viral protein receptors. This Western blot technique uses intact viral 
particles in place of antibodies to probe viral-receptors interactions. VOPBA was 

30 used herein to identify Chang C membrane proteins that bind to Ad37. In the 
absence of Ad37 particles, no protein bands were observed, while addition of 
virus in the absence of calcium revealed binding to a single 45 kDa protein. In 
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the presence of 1 mM calcium chloride, Ad37 reacted with three proteins with 
approximate molecular weights of 45, 50 and 60 kDa. The same three proteins 
were detected using a recombinant Ad37 fiber knob alone, indicating that Ad37- 
receptor interactions are fiber mediated and do not involve interactions of other 
5 capsid proteins such as the penton base. The size of the calcium-independent 
protein (45 kDa) is very similar to the known molecular weight of CAR. A direct 
comparison of the Ad37 VOPBA and a CAR Western blot showed that the 45 
kDa receptor co-migrates with CAR on SDS-PAGE. Moreover, two other 
members of subgroup D adenoviruses, Ad9 and AD15, have been shown to bind 

10 to CAR (Roelvink et al. (1 998) J. Virol. 72:7909-791 5). 

Since CAR does not appear to mediate Ad37 binding on intact Chang C 
cells, the possibility that the 50 or 60 kDa protein serves this function was 
tested by examining an adenovirus serotype that does not bind to Chang C cells. 
Ad19p, a closely related subgroup D adenovirus, binds poorly to Chang C cells 

15 (Huang et aL (1999) J. Virol. 73:2798-2802) and Ad19p recognition of the 
Ad37 receptor is therefore unlikely. Ad19p particles bound to the 45 and 60 
kDa receptors in the VOPBA, but did not bind to the 50 kDa receptor. 
Moreover, the 50 kDa receptor is expressed on Chang C cells, but not A549 
cells, which only support low levels of Ad37 binding and infection. Taken 

20 together, these data indicate that the 50 kDa protein is a primary candidate 
receptor for Ad37 on human conjunctival cells. 
Discussion 

The identification of the CAR protein as a major adenovirus receptor does 
not explain why certain subgroup D members, such as Ad37, preferentially 

25 infect ocular cells. A 50 kDa human conjunctival cell membrane protein is 

identified herein as a primary candidate for the receptor for Ad37. This 50 kDa 
protein is not present on A549 lung epithelial cells. Ad37 binding to this receptor 
is calcium-dependent, which is consistent with Ad37 binding and infection 
experiments. Ad37 also bound to a 60 kDa protein that is present on human 

30 conjunctival and lung epithelial cells. It does not, however, appear to be 

serotype specific. The molecular weights of MHC class I heavy chain, which has 
been proposed as a receptor for Ad5, and crj0 3 and crj? s intergrins, receptors for 
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the penton base, are distinct from the 50 or to kDa receptor characterized in this 
study. 

The studies of Ad37-receptor interaction using VOPBAs are consistent 
with previous studies showing that subgroup D adenoviruses can bind to the 
extracellular domain of CAR {Roeivink era/.. (1998) J.Virol. 72:7909-7915). 
Biochemical and structural studies on knob-CAR interactions indicate that the 
CAR binding site is located on the AB-loop of the fiber knob. Alignment of the 
fiber sequences of Ad37 and other adenoviruses reveals that the AB-ioop of 
Ad37 is similar to those of Ad12 and Ad5. Moreover, a phylogenetic tree of 
adenovirus knobs (Roeivink et aL (1998) J.ViroL 72:7909-7915) shows that 
fiber proteins of subgroup D are similar to those of subgroup C and E, which use 
CAR as their primary receptor. Ad37 does not, however, appear to effectively 
use CAR as a primary receptor, as demonstrated by virus binding and infection 
studies on Chang C conjunctival cells and A549 lung epithelial cells-. 

It has been reported that Ad37 uses sialic acid as a receptor on Chinese 
hamster ovary (CHO) cells and human lung carcinoma (A549) cells (Arnberg et 
aL ((2000) J. Virol. 74:42-48). Human conjunctival cells were not studied. 
Human corneal epithelial (HCE) cells were the only ocular cell line studied and 
Ad37 binds relatively poorly to these cells, compared to binding on A549 cells 
(Arnberg et aL ((2000) J. ViroL 74:42-48). In addition, 8,4 X 10 7 wheat germ 
agglutinin molecules per cell were required to significantly inhibit Ad37 binding 
to sialic acid on sialic acid positive CHO cells (Arnberg et aL (2000) J. ViroL 
74:42-48), three orders of magnitude higher than the number of Ad37 receptors 
on Chang C conjunctival cells (Huang et aL (1999) J. ViroL 73:2798-2802). 
Clearly, sialic acid is not the only factor responsible for Ad37 binding to the cell 
surface and its influence on Ad37 tropism is unclear. 

The results herein show that Ad37 selects a 50 kDa cellular receptor for 
binding to conjunctival cells, but it is possible that sialic acid also plays a role in 
this interaction. The characterization and identification of the Ad37 receptor 
have therapeutic implications and also explain the different tropism of Ad37. 
The 50 kDA receptor for Ad37 may also be the receptor for other subgroup D 
adenoviruses that cause severe cases of EKC, Ad19a and Ad8. Ad19p is a 
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nonpathogenic variant of Ad19 (Arnberg et at. (1998) Virology 227:239-244) 
while Ad19A, along with Ad8 and Ad37, are major causes of EKC. Ad 19a and 
Ad37 have identical fiber proteins (Arnberg et aL (1998) Virology 227:239-244) 
and have similar tropism in vivo. Ad8, Ad19a # and Ad37 agglutinate dog and 
5 guinea pig erythrocytes more effectively than four other serotypes that are 
associated with less severe forms of conjunctivitis (Arnberg eta/. (1998) 
Virology 227:239-244), implying that the receptors of Ad18, Ad19A, and Ad37 
have similar characteristics. Hence, this 50 kDa receptor is an attractive drug 
target against EKC caused by adenoviruses to provide therapeutic intervention of 

10 ocular diseases associated with these viruses. 

EXAMPLE 10 
Targeting of the Ad5 vector to photoreceptor cells 

The fiber-deleted adenovirus vector Ad5.GFP.AF was propagated in 705 
cells, which express a modified Ad37 fiber protein. Viral particles 

15 (Ad5.GFP.Af/37F) were harvested, CsCI-purified and dialized into 0.9% NaCl, 10 
mM Tris, pH 8.1, and 10% glycerol. Two to three //I of the resulting solution, 
containing approximately 1 x 10 9 particles/pl was injected into the vitreous 
chamber of a mouse eye. Seven days post-injection, eyes were harvested, fixed 
with paraformaldehyde and cryo-sectioned. Sections were stained with an anti- 

20 rhodopsin antibody to identify photoreceptor cells and with DAPI to show all cell 
nuclei. The resulting sections showed red anti-rhodopsin staining in the 
photoreceptors, blue DAPI-stained nuclei, and green GFP staining in any 
transduced cells. The results revealed substantially exclusive transduction of 
photoreceptors. Co-localization of rhodopsin staining and GFP expression 

25 indicated selective transduction of photoreceptor cells. 

As a control, contralateral eyes were injected with a stock of the fiber- 
deleted vector AD5.£gaLAF grown in the same Ad37 fiber-expressing cells. 
Since this virus (Ad5.£gal.AF/37F) produces 0gal rather than GFP, the green 
staining is absent from the photoreceptors. 

30 Additional experiments using the AD37 fiber for targeting to the 

photoreceptor cells have been performed. Subretinal and intra vitreal injection 
have been used in mouse models and the results demonstrate targeting to the 
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photoreceptors. As with intravitreally injected eyes, the major cell type infectd 
via subretinal administration was the photoreceptor. 

As noted, Ad5.GFP.AF /37F infected Chang C cells efficiently, but A549 
cells poorly. Ad37 fiber protein confers preferential infection on human 
5 conjunctival cells, but not CAR-expressing human lung epithelial cells. Binding 
to conjunctival cells requires divalent cations. 



Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims; 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule, comprising: 
adenovirus inverted terminal repeat sequences; an adenovirus packaging 
signal operatively linked thereto; and a photoreceptor-specific promoter. 
5 2. The isolated nucleic acid molecule of claim 1, further comprising a 

nucleic acid encoding a therapeutic product operatively linked to the promoter. 

3. The isolated nucleic acid molecule of claim 1, wherein the 
promoter is a rhodopsin promoter. 

4. The nucleic acid molecule of claim 1 , wherein the adenovirus 
10 genome does not encode a functional fiber protein such that packaging the 

nucleic acid requires complementation in a packaging cell. 

5. A recombinant adenovirus vector, comprising the nucleic acid 
molecule of any of claims 1-4 packaged therein. 

6. A recombinant adenovirus vector of claim 5, wherein inverted 
1 5 terminal repeat sequences (ITR) and a packaging signal are derived from 

adenovirus type 2 or adenovirus type 5. 

7. A recombinant adenovirus vector of claim 5, wherein the virus 
comprises a fiber protein. 

8. A recombinant adenovirus vector of claim 7, wherein the fiber 
20 protein selectively binds to photoreceptors in the eye of a mammal. 

9. A recombinant adenovirus vector of claim 7, wherein the fiber is a 
chimera composed of N-terminal sequences from adenovirus type 2 or type 5, 
and a sufficient portion of an adenovirus serotype D fiber for selective binding to 
photoreceptors in the eye of a mammal.. 

25 10. A method for targeted delivery of a gene product to the eye of a 

mammal, comprising: 

administering a recombinant adenovirus virus that comprises heterologous 
DNA encoding the gene product or resulting in expression of the gene product, 
wherein the recombinant virus comprises a fiber protein that specifically or 

30 selectively binds to receptors that are expressed on cells in the eye. 
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1 1 . The method of claim 10, wherein the cells are photoreceptors. 

12. the method of claim 10, wherein administration is effected by 
intraocular delivery. 

13. The method of claim 10, wherein administration is effected by a 
method selected from subretinal injection, intravenous administration, periorbital 
administration, and intravitreai administration. 

* 14. The method of claim 10, wherein the recombinant virus comprises 
a fiber protein from an adenovirus type D serotype. 

15. The method of any of claims 10-14, wherein the fiber protein is an 

adenovirus type 37. 

1 6. The method of any of claims 10-14, wherein the fiber is a chimeric 
protein containing a sufficient portion of the N-terminus of an adenovirus type 2 
or type 5 fiber protein for interaction with an adenovirus type 2 or type 5 
penton, and a sufficient portion of an adenovirus serotype D knob potion of the 
fiber for selective binding to photoreceptors in the eye of a mammal. 

17. The method of any of claims 10-16, wherein the recombinant virus 
is an adenovirus type D serotype. 

18. The method of any of claims 10-17, wherein the encapsulated 
nucleic acid comprises a photoreceptor-specific promoter operatively linked to a 
nucleic acid comprising the therapeutic product. 

19. The method of claim 18, wherein the therapeutic product is 
selected from the group consisting of a trophic factor, an anti-apoptotic factor, a 
gene encoding a rhodopsin protein, a wild-type Stargardt disease gene (STDG1), 
an anti-cancer agent and a protein that regulates expression of a photoreceptor- 
specific gene product. 

20. The method of any of claims 10-1 9, wherein delivery is effected 
for treatment of an ocular disease. 

21 . The method of claim 20, wherein the disorder is a retinal 
degenerative disease. 

22. The method of claim 20, wherein the disease is retinitis 
pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, or 
retinoblastoma. 
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23. The method of any of claims 10-22, wherein the mammal is a 
human. 

24. The method of any of claims 10-22, wherein the viral nucleic, acid 
comprises: 

an adenovirus inverted terminal repeat (ITR) sequences; and 
an adenovirus packaging signal operatively linked thereto. 

25. The method of claim 24, wherein the ITRs and packaging signal 
are derived from an adenovirus serotype BorC. 

26. The method of claim 24, wherein the ITRs and packaging signal 
are derived from an adenovirus type 2 or 5. 

27. The method of claim 24, wherein the viral nucleic acid further 
comprises a photoreceptor-specific promoter. 

28. A method of targeted gene therapy, comprising: 
administering a recombinant viral vector that comprises an adenovirus 

type 37 fiber protein or portion thereof, whereby the vector selectively 
transduces photoreceptors and delivers a gene product encoded by the 
recombinant viral vector; wherein the portion is sufficient for selective binding to 
photoreceptors. 

29. The method of claim 28, wherein the vector is administered into 
the eye. 

30. The method of claim 28, wherein the vector is administered to the 
vitreous cavity of the eye. 

31 . The method of claim 28, wherein administration is effected by 
subretinal injection, intravenous administration, periorbital administration or 
intravitreal administration. 

32. The method of any of claims 10-31, wherein at least about 10 7 
plaque forming units of virus are administered. 

33. The method of any of claims 10-31, wherein about 1 plaque 
forming unit to about 10 14 plaque forming units of virus are administered. 
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SEQUENCE LISTING 



<110> VON SEGGERN, DANIEL 
NEMEROW, GLEN R. 
FRIEDLANDER, MARTIN 

<120> VECTORS FOR OCULAR TRANSDUCTION AND USE THEREFOR FOR GENETIC THERAPY 

<130> 756.1PCT/NOV0205P 

<140> 

<141> 2001-05-01 

<150> 09/562,934 
<151> 2000-05-01 

<160> 51 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



cggtacacag aattcaggag acacaactcc 



<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

gcctggatcc gggaagttac gtaacgtggg aaaac 



<210> 3 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> , , , 

<223> Description of Artificial Sequence: linker 

<400> 3 
cgcggatccg eg 



<210> 4 
<211> 8710 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: plasmid 



<400> 4 

cacctaaatt 

ctcatttttt 

cgagataggg 

ctccaacgtc 

accctaatca 

gagcccccga 

gaaagcgaaa 

caccacaccc 

gcgcaactgt 

agggggatgt 

ttgtaaaacg 

gccccccctc 

caagtgcata 

tatttgccac 

ttatgtttca 

agtatagccc 

accctagtat 

cggctggcct 

cacacggttt 

tcacttaagt 

tgcttaacgg 

atcaggatag 

gtcctgcagg 

ataaggcgcc 

taactgcagc 

aagctcatgg 

aagtggcgac 

ttcaccacct 

ctaaaccagc 

caatgacagt 

atgttggcac 

gttagaacca 

ggaagacctc 

ggatgatcct 

ctgtacggag 

acgccggacg 

gcgtctccgg 

caaagcatcc 

cctgataaca 

cgagtcacac 

aaagattatc 

cgtggtcaaa 

cttccaaaag 

gaatctcctc 

accttctcaa 

gctccagagc 

ttcctcacag 

taggtccctt 

cacttccccg 

agctatgcta 

gcaaggtgct 

catgctcatg 

ttctctcaaa 

ttaaacatta 

.tacggccatg 

cagctcctcg 

catcggtcag 

gagacaacat 

aaacacctga 



gtaagcgtta 
aaccaatagg 
ttgagtgttg 
aaagggcgaa 
agtttttt;gg 
tttagagctt 
ggagcgggcg 
gccgcgctta 
tgggaagggc 
gctgcaaggc 
acggccagtg 
gaggtcgacg 
ctctatgtca 
atcctcttac 
acgtgtttat 
caccaccaca 
tcaacctgcc 
taaaaagcat 
cctgtcgagc 
tcatgtcgct 
gcggcgaagg 
ggcggtggtg 
aatacaacat 
ttgtcctccg 
acagcaccac 
cgcfggaccac 
ccctcataaa 
cccggtacca 
tggccaaaac 
ggagagccca 
aacacaggca 
tatcccaggg 
gcacgtaact 
ccagtatggt 
tgcgccgaga 
tagtcatatt 
tctcgccgct 
aggcgccccc 
tccaccaccg 
acgggaggag 
caaaacctca 
ctctacagcc 
gcaaacggcc 
tataaacatt 
tatatctcta 
gccctccacc 
acctgtataa 
cgcagggcca 
ccaggaacct 
accagcgtag 
gctcaaaaaa 
cagataaagg 
catgtctgcg 
gaagcctgtc 
ccggcgtgac 
gtcatgtccg 
tgctaaaaag 
tacagccccc 
aaaaccctcc 



atattttgtt 
ccgaaatcgg 
ttccagtttg 
aaaccgtcta 
ggtcgaggtg 
gacggggaaa 
ctagggcgct 
atgcgccgct 
gatcggtgcg 
gattaagttg 
aattgtaata 
gtatcgataa 
ttttcatggg 
actttttcat 
ttttcaattg 
tagcttatac 
acctccctcc 
catatcatgg 
caaacgctca 
gtccagctgc 
agaagtccac 
ctgcagcagc 
ggcagtggtc 
ggcacagcag 
aatattgttc 
agaacccacg 
cacgctggac 
tataaacctc 
ctgcccgccg 
ggactcgtaa 
cacgtgcata 
aacaacccat 
cacgttgtgc 
agcgcgggtt 
caaccgagat 
tcctgaagca 
tagatcgctc 
tggcttcggg 
cagaataagc 
cgggaagagc 
aaatgaagat 
aaagaacaga 
ctcacgtcca 
ccagcacctt 
agcaaatccc 
ttcagcctca 
gattcaaaag 
gctgaacata 
tgacaaaaga 
ccccgatgta 
tcaggcaaag 
caggtaagct 
ggtttctgca 
ttacaacagg 
cgtaaaaaaa 
gagtcataat 
cgaccgaaat 
ataggaggta 
tgcctaggca 



aaaattcgcg 
caaaatccct 
gaacaagagt 
tcagggcgat 
ccgtaaagca 
gccggcgaac 
ggcaagtgta 
acagggcgcg 
ggcctcttcg 
ggtaacgcca 
cgactcacta 
gcttgatatc 
actggtctgg 
acattgccca 
cagaaaattt 
agatcaccgt 
caacacacag 
gtaacagaca 
tcagtgatat 
tgagccacag 
gcctacatgg 
gcgcgaataa 
tcctcagcga 
cgcaccctga 
aaaatcccac 
tggccatcat 
ataaacatta 
tgattaaaca 
gctatacact 
ccatggatca 
cacttcctca 
tcctgaatca 
attgtcaaag 
tctgtctcaa 
cgtgttggtc 
aaaccaggtg 
tgtgtagtag 
ttctatgtaa 
cacacccagc 
tggaagaacc 
ctattaagtg 
taatggcatt 
agtggacgta 
caaccatgcc 
gaatattaag 
agcagcgaat 
cggaacatta 
atcgtgcagg 
acccacactg 
agctttgttg 
cctcgcgcaa 
ccggaaccac 
taaacacaaa 
aaaaacaacc 
ctggtcaccg 
gtaagactcg 
agcccggggg 
taacaaaatt 
aaatagcacc 



ttaaattttt 
tataaatcaa 
ccactattaa 
ggcccactac 
ctaaatcgga 
gtggcgagaa 
gcggtcacgc 
tcccattcgc 
ctattacgcc 
gggttttccc 
tagggcgaat 
gaattcagga 
ccacaactac 
agaataaaga 
caagtcattt 
accttaatca 
agtacacagt 
tattcttagg 
taataaactc 
gctgctgtcc 
gggtagagtc 
actgctgccg 
tgattcgcac 
tctcacttaa 
agtgcaaggc 
accacaagcg 
cctcttttgg 
tggcgccatc 
gcagggaacc 
tcatgctcgt 
ggattacaag 
gcgtaaatcc 
tgttacattc 
aaggaggtag 
gtagtgtcat 
cgggcgtgac 
ttgtagtata 
actccttcat 
caacctacac 
atgttttttt 
aacgcgctcc 
tgtaagatgt 
aaggctaaac 
caaataattc 
tccggccatt 
catgattgca 
acaaaaatac 
tctgcacgga 
attatgacac 
catgggcggc 
aaaagaaagc 
cacagaaaaa 
ataaaataac 
cttataagca 
tgattaaaaa 
gtaaacacat 
aatacatacc 
aataggagag 
ctcccgctcc 



gttaaatcag 
aagaatagac 
agaacgtgga 
gtgaaccatc 
accctaaagg 
aggaagggaa 
tgcgcgtaac 
cattcaggct 
agctggcgaa 
agtcacgacg 
tgggtaccgg 
gacacaactc 
attaatgaaa 
atcgtttgtg 
ttcattcagt 
aactcacaga 
cctttctccc 
tgttatattc 
cccgggcagc 
aacttgcggt 
ataatcgtgc 
ccgccgctcc 
cgcccgcagc 
atcagcacag 
gctgtatcca 
caggtagatt 
catgttgtaa 
caccaccatc 
gggactggaa 
catgatatca 
ctcctcccgc 
cacactgcag 
gggcagcagc 
acgatcccta 
gccaaatgga 
aaacagatct 
tccactctct 
gcgccgctgc 
attcgttctg 
ttttattcca 
cctccggtgg 
tgcacaatgg 
ccttcagggt 
tcatctcgcc 
gtaaaaatct 
aaaattcagg 
cgcgatcccg 
ccagcgcggc 
gcatactcgg 
gatataaaat 
acatcgtagt 
gacaccattt 
aaaaaaacat 
taagacggac 
gcaccaccga 
caggttgatt 
cgcaggcgta 
aaaaacacat 
agaacaacat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 
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acagcgcttc acagcggcag cctaacagtc agccttacca gtaaaaaaga aaacctatta 3600 
aaaaaacacc actcgacacg gcaccagctc aatcagtcac agtgtaaaaa agggccaagt 3660 
gcagagcgag tatatatagg actaaaaaat gacgtaacgg ttaaagtcca caaaaaacac 3720 
ccagaaaacc gcacgcgaac ctacgcccag aaacgaaagc caaaaaaccc acaacttcct 3780 
caaatcgtca cttccgtttt cccacgttac gtaacttccc ggatccgcgg cattcacagt 3840 
tctccgcaag aattgattgg ctccaattct tggagtggtg aatccgttag cgaggtgccg 3900 
ccggcttcca ttcaggtcga ggtggcccgg ctccatgcac cgcgacgcaa cgcggggagg 3960 
cagacaaggt atagggcggc gcctacaatc catgccaacc cgttccatgt gctcgccgag 4020 
gcggcataaa tcgccgtgac gatcagcggt ccagtgatcg aagttaggct ggtaagagcc 4080 
gcgagcgatc cttgaagctg tccctgatgg tcgtcatcta cctgcctgga cagcatggcc 4140 
tgcaacgcgg gcatcccgat gccgccggaa gcgagaagaa tcataatggg gaaggccatc 4200 
cagcctcgcg tcgcgaacgc cagcaagacg tagcccagcg cgtcggccgc catgccctgc 4260 
ttcatccccg tggcpcgttg ctcgcgtttg ctggcggtgt ccccggaaga aatatatttg 4320 
catgtcttta gttctatgat gacacaaacc ccgcccagcg tcttgtcatt ggcgaattcg 43 80 
aacacgcaga tgcagtcggg gcggcgcggt cccaggtcca cttcgcatat taaggtgacg 4440 
cgtgtggcct cgaacaccga gcgaccctgc agcgacccgc ttaacagcgt caacagcgtg 4500 
ccgcagatcc cgggcaatga gatatgaaaa agcctgaact caccgcgacg tctgtcgaga 4560 
agtttctgat cgaaaagttc gacagcgtct ccgacctgat gcagctctcg gagggcgaag 4620 
aatctcgtgc tttcagcttc gatgtaggag ggcgtggata tgtcctgcgg gtaaatagct 4680 
gcgccgatgg tttctacaaa gatcgttatg tttatcggca ctttgcatcg gccgcgctcc 4740 
cgattccgga agtgcttgac attggggaat tcagcgagag cctgacctat tgcatctccc 4800 
gccgtgcaca gggtgtcacg ttgcaagacc tgcctgaaac cgaactgccc gctgttctgc 4860 
agccggtcgc ggaggccatg gatgcgatcg ctgcggccga tcttagccag acgagcgggt 4920 
tcggcccatt cggaccgcaa ggaatcggtc aatacactac atggcgtgat ttcatatgcg 4980 
cgattgctga tccccatgtg tatcactggc aaactgtgat ggacgacacc gtcagtgcgt 5040 
ccgtcgcgca ggctctcgat gagctgatgc tttgggccga ggactgcccc gaagtccggc 5100 
acctcgtgca cgcggatttc ggctccaaca atgtcctgac ggacaatggc cgcataacag 5160 
cggtcattga ctggagcgag gcgatgttcg gggattccca atacgaggtc gccaacatct 5220 
tcttctggag gccgtggttg gcttgtatgg agcagcagac gcgctacttc gagcggaggc 5280 
atccggagct tgcaggatcg ccgcggctcc gggcgtatat gctccgcatt ggtcttgacc 5340 
aactctatca gagcttggtt gacggcaatt tcgatgatgc agcttgggcg cagggtcgat 5400 
gcgacgcaat cgtccgatcc ggagccggga ctgtcgggcg tacacaaatc gcccgcagaa 5460 
gcgcggccgt ctggaccgat ggctgtgtag aagtactcgc cgatagtgga aaccgacgcc 5520 
ccagcactcg tccgagggca aaggaatagg ggagatgggg gaggctaact gaaacacgga 5580 
aggagacaat accggaagga acccgcgcta tgacggcaat aaaaagacag aataaaacgc 5640 
acgggtgttg ggtcgtttgt tcataaacgc ggggttcggt cccagggctg gcactctgtc 5700 
gataccccac cgagacccca ttggggccaa tacgcccgcg tttcttcctt ttccccaccc 5760 
caccccccaa gttcgggtga aggcccaggg ctcgcagcca acgtcggggc ggcaggccct 5820 
gccatagcca ctggccccgt gggttaggga cggggtcccc catggggaat ggtttatggt 5880 
tcgtgggggt tattattttg ggcgttgcgt ggggtctggt ccacgactgg actgagcaga 5940 
cagacccatg gtttttggat ggcctgggca tggaccgcat gtactggcgc gacacgaaca 6000 
ccgggcgtct gtggctgcca aacacccccg acccccaaaa accaccgcgc ggatttctgg 6060 
cgcccagtgc cgtcgaccgg tcatggctgc gccccgacac ccgccaacac ccgctgacgc 6120 
gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 6180 
gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc agccggatca 6240 
taatcagcca taccacattt gtagaggttt tacttgcttt aaaaaacctc cccacctccc 6300 
cctgaacctg aaacataaaa tgaatgcaat tgttgttgtt aacttgttta ttgcagctta 6360 
taatggttac aaataaagca atagcatcac aaatttcaca aataaagcat ttttttcact 6420 
gcattctagt tgtggtttgt ccaaactcat caatgtatct tatcatgtct ggatccacta 6480 
gttctagagc ggccgccacc gcggtggagc tccagctttt gttcccttta gtgagggtta 6540 
atttcgagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc 6600 
acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga 6660 
gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg 6720 
tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg 6780 
cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg 6840 
gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga 6900 
aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 6960 
gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 7020 
aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 7080 
gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 7140 
ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 7200 
cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 7260 
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ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 7320 
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 7380 
tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 7440 
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 7500 
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 7560 
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 7620 
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 7680 
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 7740 
agtgaggcac ct ate t cage gatctgtcta tttcgttcat ccatagttgc ctgactcccc 7800 
gtcgtgtaga taactacgat aegggaggge ttaccatctg gccccagtgc tgcaatgata 7860 
ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc ageeggaagg 7920 
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 7980 
egggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgecattget 8040 
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 8100 
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 8160 
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 8220 
ctgeataatt ctcttactgt catgccatcc gtaagatget tttctgtgac tggtgagtac 8280 
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 8340 
ataegggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 8400 
tettegggge gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 8460 
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 8520 
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg egacaeggaa atgttgaata 8580 
ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 8640 
ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 8700 
cgaaaagtgc 8710 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

atgggatcca agatgaagcg cgcaagaccg 



<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

cataacgegg ccgcttcttt attcttgggc 

<210> 7 
<211> 7148 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 



gaeggategg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagecagtat ctgctccctg cttgtgtgtt ggaggtcget gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgeatg aagaatctgc 180 



<400> 7 
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ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 

attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

actcacgggg a'tttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 

aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 

gagctcggat ccaagatgaa gcgcgcaaga ccgtctgaag ataccttcaa ccccgtgtat 960 

ccatatgaca cggaaaccgg tcctccaact gtgccttttc ttactcctcc ctttgtatcc 1020 

cccaatgggt ttcaagagag tccccctggg gtactctctt tgcgcctatc cgaacctcta 1080 

gttacctcca atggcatgct tgcgctcaaa atgggcaacg gcctctctct ggacgaggcc -1140 

ggcaacctta cctcccaaaa tgtaaccact gtgagcccac ctctcaaaaa aaccaagtca 1200 

aacataaacc tggaaatatc tgcacccctc acagttacct cagaagccct aactgtggct 1260 

gccgccgcac ctctaatggt cgcgggcaac acactcacca tgcaatcaca ggccccgcta 132 0 

accgtgcacg actccaaact tagcattgcc acccaaggac ccctcacagt gtcagaagga 1380 

aagctagccc tgcaaacatc aggccccctc accaccaccg atagcagtac ccttactatc 1440 

actgcctcac cccctctaac tactgccact ggtagcttgg gcattgactt gaaagagccc 1500 

atttatacac aaaatggaaa actaggacta aagtacgggg ctcctttgca tgtaacagac 1560 

gacctaaaca ctttgaccgt agcaactggt ccaggtgtga ctattaataa tacttccttg 1620 

caaactaaag ttactggagc cttgggtttt gattcacaag gcaatatgca acttaatgta 1680 

gcaggaggac taaggattga ttctcaaaac agacgcctta tacttgatgt tagttatccg 1740 

tttgatgctc aaaaccaact aaatctaaga ctaggacagg gccctctttt tataaactca 1800 

gcccacaact tggatattaa ctacaacaaa ggcctttact tgtttacagc ttcaaacaat 1860 

tccaaaaagc ttgaggttaa cctaagcact gccaaggggt tgatgtttga cgctacagcc 1920 

atagccatta atgcaggaga tgggcttgaa tttggttcac ctaatgcacc aaacacaaat 1980 

cccctcaaaa caaaaattgg ccatggccta gaatttgatt caaacaaggc tatggttcct 2040 

aaactaggaa ctggccttag ttttgacagc acaggtgcca ttacagtagg aaacaaaaat 2100 
aatgataagc taactttgtg gaccacacca gctccatctc ctaactgtag actaaatgca 2160 

gagaaagatg ctaaactcac tttggtctta acaaaatgtg gcagtcaaat acttgctaca 2220 
gtttcagttt tggctgttaa aggcagtttg gctccaatat ctggaacagt tcaaagtgct 2280 

catcttatta taagatttga cgaaaatgga gtgctactaa acaattcctt cctggaccca 2340 

gaatattgga actttagaaa tggagatctt actgaaggca cagcctatac aaacgctgtt 2400 

ggatttatgc ctaacctatc agcttatcca aaatctcacg gtaaaactgc caaaagtaac 2460 

attgtcagtc aagtttactt aaacggagac aaaactaaac ctgtaacact aaccattaca 2520 

ctaaacggta cacaggaaac aggagacaca actccaagtg catactctat gtcattttca 2580 

tgggactggt ctggccacaa ctacattaat gaaatatttg ccacatcctc ttacactttt 2640 

tcatacattg cccaagaata aagaagcggc cgctcgagca tgcatctaga gggccctatt 2700 

ctatagtgtc acctaaatgc tagagctcgc tgatcagcct cgactgtgcc ttctagttgc 2760 

cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 2820 

actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct 2880 

attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg 2940 

catgctgggg atgcggtggg ctctatggct tctgaggcgg aaagaaccag ctggggctct 3000 

agggggtatc cccacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg 3060 

cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct 3120 

tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg catcccttta 3180 

gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt 3240 

tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg 3300 

ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat 3360 

tcttttgatt tataagggat tttggggatt tcggcctatt ggttaaaaaa tgagctgatt 3420 

taacaaaaat ttaacgcgaa ttaattctgt ggaatgtgtg tcagttaggg tgtggaaagt 3480 

ccccaggctc cccaggcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc 3540 

aggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat 3600 

tagtcagcaa ccatagtccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt 3660 

tccgcccatt ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc 3720 

gcctctgcct ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt 3780 

tgcaaaaagc tcccgggagc ttgtatatcc attttcggat ctgatcaaga gacaggatga 3840 

ggatcgtttc gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg 3900 
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gagaggctat 
ttccggctgt 
ctgaatgaac 
tgcgcagctg 
gtgccggggc 
gctgatgcaa 
gcgaaacatc 
gatctggacg 
cgcatgcccg 
atggtggaaa 
cgctatcagg 
gctgaccgct 
tatcgccttc 
cgacgcccaa 
gcttcggaat 
tggagttctt 
atagcatcac 
ccaaactcat 
cgtaatcatg 
acatacgagc 
cattaattgc 
attaatgaat 
cctcgctcac 
caaaggcggt 
caaaaggcca 
ggctccgccc 
cgacaggact 
ttccgaccct 
tttctcaatg 
gctgtgtgca 
ttgagtccaa 
ttagcagagc 
gctacactag 
aaagagttgg 
tttgcaagca 
ctacggggtc 
tatcaaaaag 
aaagtatata 
tctcagcgat 
ctacgatacg 
gctcaccggc 
gtggtcctgc 
taagtagttc 
tgtcacgctc 
ttacatgatc 
tcagaagtaa 
ttactgtcat 
tctgagaata 
ccgcgccaca 
aactctcaag 
actgatcttc 
aaaatgccgc 
tttttcaata 
aatgtattta 
ctgacgtc 



tcggctatga 
cagcgcaggg 
tgcaggacga 
tgctcgacgt 
aggatctcct 
tgcggcggct 
gcatcgagcg 
aagagcatca 
acggcgagga 
atggccgctt 
acatagcgtt 
tcctcgtgct 
ttgacgagtt 
cctgccatca 
cgttttccgg 
cgcccacccc 
aaatttcaca 
caatgtatct 
gtcatagctg 
cggaagcata 
gttgcgctca 
cggccaacgc 
tgactcgctg 
aatacggtta 
gcaaaaggcc 
ccctgacgag 
ataaagatac 
gccgcttacc 
ctcacgctgt 
cgaacccccc 
cccggtaaga 

gaggtatgta 

aaggacagta 
tagctcttga 
gcagattacg 
tgacgctcag 
gatcttcacc 
tgagtaaact 
ctgtctattt 
ggagggctta 
tccagattta 
aactttatcc 
gccagttaat 
gtcgtttggt 
ccccatgttg 
gttggccgca 
gccatccgta 
gtgtatgcgg 
tagcagaact 
gatcttaccg 
agcatctttt 
aaaaaaggga 
ttattgaagc 
gaaaaataaa 



ctgggcacaa 
gcgcccggtt 
ggcagcgcgg 
tgtcactgaa 
gtcatctcac 
gcatacgctt 
agcacgtact 
ggggctcgcg 
tctcgtcgtg 
ttctggattc 
ggctacccgt 
ttacggtatc 
cttctgagcg 
cgagatttcg 
gacgccggct 
aacttgttta 
aataaagcat 
tatcatgtct 
tttcctgtgt 
aagtgtaaag 
ctgcccgctt 
gcggggagag 
cgctcggtcg 
tccacagaat 
aggaaccgta 
catcacaaaa 
caggcgtttc 
ggatacctgt 
aggtatctca 
gttcagcccg 
cacgacttat 
ggcggtgcta 
tttggtatct 
tccggcaaac 
cgcagaaaaa 
tggaacgaaa 
tagatccttt 
tggtctgaca 
cgttcatcca 
ccatctggcc 
tcagcaataa 
gcctccatcc 
agtttgcgca 
atggcttcat 
tgcaaaaaag 
gtgttatcac 
agatgctttt 
cgaccgagtt 
ttaaaagtgc 
ctgttgagat 
actttcacca 
ataagggcga 
atttatcagg 
caaatagggg 



cagacaatcg 
ctttttgtca 
ctatcgtggc 
gcgggaaggg 
cttgctcctg 
gatccggcta 
cggatggaag 
ccagccgaac 
acccatggcg 
atcgactgtg 
gatattgctg 
gccgctcccg 
ggactctggg 
attccaccgc 
ggatgatcct 
ttgcagctta 
ttttttcact 
gtataccgtc 
gaaattgtta 
cctggggtgc 
tccagtcggg 
gcggtttgcg 
ttcggctgcg 
caggggataa 
aaaaggccgc 
atcgacgctc 
cccctggaag 
ccgcctttct 
gttcggtgta 
accgctgcgc 
cgccactggc 
cagagttctt 
gcgctctgct 
aaaccaccgc 
aaggatctca 
actcacgtta 
taaattaaaa 
gttaccaatg 
tagttgcctg 
ccagtgctgc 
accagccagc 
agtctattaa 
acgttgttgc 
tcagctccgg 
cggttagctc 
tcatggttat 
ctgtgactgg 
gctcttgccc 
tcatcattgg 
ccagttcgat 
gcgtttctgg 
cacggaaatg 
gttattgtct 
ttccgcgcac 



gctgctctga 
agaccgacct 
tggccacgac 
actggctgct 
ccgagaaagt 
cctgcccatt 
ccggtcttgt 
tgttcgccag 
atgcctgctt 
gccggctggg 
aagagcttgg 
attcgcagcg 
gttcgaaatg 
cgccttctat 
ccagcgcggg 
taatggttac 
gcattctagt 
gacctctagc 
tccgctcaca 
ctaatgagtg 
aaacctgtcg 
tattgggcgc 
gcgagcggta 
cgcaggaaag 
gttgctggcg 
aagtcagagg 
ctccctcgtg 
cccttcggga 
ggtcgttcgc 
cttatccggt 
agcagccact 
gaagtggtgg 
gaagccagtt 
tggtagcggt 
agaagatcct 
agggattttg 
atgaagtttt 
cttaatcagt 
actccccgtc 
aatgataccg 
cggaagggcc 
ttgttgccgg 
cattgctaca 
ttcccaacga 
cttcggtcct 
ggcagcactg 
tgagtactca 
ggcgtcaata 
aaaacgttct 
gtaacccact 
gtgagcaaaa 
ttgaatactc 
catgagcgga 
atttccccga 



tgccgccgtg 
gtccggtgcc 
gggcgttcct 
attgggcgaa 
atccatcatg 
cgaccaccaa 
cgatcaggat 
gctcaaggcg 
gccgaatatc 
tgtggcggac 
cggcgaatgg 
catcgccttc 
accgaccaag 
gaaaggttgg 
gatcfccatgc 
aaataaagca 
tgtggtttgt 
tagagcttgg 
attccacaca 
agctaactca 
tgccagctgc 
tcttccgctt 
tcagctcact 
aacatgtgag 
tttttccata 
tggcgaaacc 
cgctctcctg 
agcgtggcgc 
tccaagctgg 
aactatcgtc 
ggtaacagga 
cctaactacg 
accttcggaa 
ggtttttttg 
ttgatctttt 
gtcatgagat 
aaatcaatct 
gaggcaccta 
gtgtagataa 
cgagacccac 
gagcgcagaa 
gaagctagag 
ggcatcgtgg 
tcaaggcgag 
ccgatcgttg 
cataattctc 
accaagtcat 
cgggataata 
tcggggcgaa 
cgtgcaccca 
acaggaaggc 
atactcttcc 
tacatatttg 
aaagtgccac 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7148 



<210> 8 
<211> 7469 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 8 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg . cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ctgaattcga gctcgctgtt gggctcgcgg ttgaggacaa actcttcgcg 960 
gtctttccag tactcttgga tcggaaaccc gtcggcctcc gaacggtact ccgccaccga 1020 
gggacctgag cgagtccgca tcgaccggat cggaaaacct ctcgagaaag gcgtctaacc 1080 
agtcacagtc gcaaggtagg ctgagcaccg tggcgggcgg cagcgggtgg cggtcggggt 1140 
tgtttctggc ggaggtgctg ctgatgatgt aattaaagta ggcggtcttg agacggcgga 1200 
tggtcgaggt gaggtgtggc aggcttgaga tccaagatga agcgcgcaag accgtctgaa 1260 
gataccttca accccgtgta tccatatgac acggaaaccg gtcctccaac tgtgcctttt 1320 
cttactcctc cctttgtatc ccccaatggg tttcaagaga gtccccctgg ggtactctct 1380 
ttgcgcctat ccgaacctct agttacctcc aatggcatgc ttgcgctcaa aatgggcaac 1440 
ggcctctctc tggacgaggc cggcaacctt acctcccaaa atgtaaccac tgtgagccca 1500 
cctctcaaaa aaaccaagtc aaacataaac ctggaaatat ctgcacccct cacagttacc 1560 
tcagaagccc taactgtggc tgccgccgca cctctaatgg tcgcgggcaa cacactcacc 1620 
atgcaatcac aggccccgct aaccgtgcac gactccaaac ttagcattgc cacccaagga 1680 
cccctcacag tgtcagaagg aaagctagcc ctgcaaacat caggccccct caccaccacc 1740 
gatagcagta cccttactat cactgcctca ccccctctaa ctactgccac tggtagcttg 1800 
ggcattgact ■ tgaaagagcc catttataca caaaatggaa aactaggact aaagtacggg 1860 
gctcctttgc atgtaacaga cgacctaaac actttgaccg tagcaactgg tccaggtgtg 1920 
actattaata atacttcctt gcaaactaaa gttactggag ccttgggttt tgattcacaa 1980 
ggcaatatgc aacttaatgt agcaggagga ctaaggattg attctcaaaa cagacgcctt 2040 
atacttgatg ttagttatcc gtttgatgct caaaaccaac taaatctaag actaggacag 2100 
ggccctcttt ttataaactc agcccacaac ttggatatta actacaacaa aggcctttac 2160 
ttgtttacag cttcaaacaa ttccaaaaag cttgaggtta acctaagcac tgccaagggg 2220 
ttgatgtttg acgctacagc catagccatt aatgcaggag atgggcttga atttggttca 2280 
cctaatgcac caaacacaaa tcccctcaaa acaaaaattg gccatggcct agaatttgat 2340 
tcaaacaagg ctatggttcc taaactagga actggcctta gttttgacag cacaggtgcc 2400 
attacagtag gaaacaaaaa taatgataag ctaactttgt ggaccacacc agctccatct 2460 
cctaactgta gactaaatgc agagaaagat gctaaactca ctttggtctt aacaaaatgt .2520 
ggcagtcaaa tacttgctac agtttcagtt ttggctgtta aaggcagttt ggctccaata 2580 
tctggaacag ttcaaagtgc tcatcttatt ataagatttg acgaaaatgg agtgctacta 2640 
aacaattcct tcctggaccc agaatattgg aactttagaa atggagatct tactgaaggc 2700 
acagcctata caaacgctgt tggatttatg cctaacctat cagcttatcc aaaatctcac 2760 
ggtaaaactg ccaaaagtaa cattgtcagt caagtttact taaacggaga caaaactaaa 2820 
cctgtaacac taaccattac actaaacggt acacaggaaa caggagacac aactccaagt 2880 
gcatactcta tgtcattttc atgggactgg tctggccaca actacattaa tgaaatattt 2940 
gccacatcct cttacacttt ttcatacatt gcccaagaat aaagaagcgg ccgctcgagc 3000 
atgcatctag agggccctat tctatagtgt cacctaaatg ctagagctcg ctgatcagcc 3060 
tcgactgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg 3120 
accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat 3180 
tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag 3240 
gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggc ttctgaggcg 3300 
gaaagaacca gctggggctc tagggggtat ccccacgcgc cctgtagcgg cgcattaagc 3360 
gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc 3420 
gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct 3480 
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ctaaatcggg gcatcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa 3540 
aaacttgatt agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 3600 
cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca 3660 
ctcaacccta tctcggtcta ttcttttgat ttataaggga ttttggggat ttcggcctat 3720 
tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attaattctg tggaatgtgt 3780 
gtcagttagg gtgtggaaag tccccaggct ccccaggcag gcagaagtat gcaaagcatg 3840 
catctcaatt agtcagcaac caggtgtgga aagtccccag gctccccagc aggcagaagt 3900 
atgcaaagca tgcatctcaa ttagtcagca accatagtcc cgcccctaac tccgcccatc 3960 
ccgcccctaa ctccgcccag ttccgcccat tctccgcccc atggctgact aatttttttt 402 0 
atttatgcag aggccgaggc cgcctctgcc tctgagctat tccagaagta gtgaggaggc 4080 
ttttttggag gcctaggctt ttgcaaaaag ctcccgggag cttgtatatc cattttcgga 4140 
tctgatcaag agacaggatg aggatcgttt cgcatgattg aacaagatgg attgcacgca 42 00 
ggttctccgg ccgcttgggb ggagaggcta ttcggctatg actgggcaca acagacaatc 4260 
ggctgctctg atgccgccgt gttccggctg tcagcgcagg ggcgcccggt tctttttgtc 4320 
aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg aggcagcgcg gctatcgtgg 43 80 
ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg 4440 
gactggctgc tattgggcga agtgccgggg caggatctcc tgtcatctca ccttgctcct 4500 
gccgagaaag tatccatcat ggctgatgca atgcggcggc tgcatacgct tgatccggct 4560 
acctgcccat tcgaccacca agcgaaacat cgcatcgagc gagcacgtac tcggatggaa 4620 
gccggtcttg tcgatcagga tgatctggac gaagagcatc aggggctcgc gccagccgaa 4680 
ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg atctcgtcgt gacccatggc 4740 
gatgcctgct tgccgaatat catggtggaa aatggccgct tttctggatt catcgactgt 4800 
ggccggctgg gtgtggcgga ccgctatcag gacatagcgt tggctacccg tgatattgct 4860 
gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat cgccgctccc 4920 
gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc gggactctgg 4980 
ggttcgaaat gaccgaccaa gcgacgccca acctgccatc acgagatttc gattccaccg 5040 
ccgccttcta* tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc tggatgatcc 5100 
tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt attgcagctt 5160 
ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac 5220 
tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tgtataccgt 5280 
cgacctctag ctagagcttg gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt 5340 
atccgctcac aattccacac aacatacgag ccggaagcat aaagtgtaaa gcctggggtg 5400 
cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct ttccagtcgg 5460 
gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 5520 
gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 5580 
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 5640 
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 5700 
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 5760 
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 5820 
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 5880 
tcccttcggg aagcgtggcg ctttctcaat gctcacgctg taggtatctc agttcggtgt 5940 
aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 6000 
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 6060 
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 6120 
tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc 6180 
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 6240 
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 6300 
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 6360 
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 6420 
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 6480 
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 6540 
gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 6600 
caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 6660 
ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 6720 
attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 6780 
ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 6840 
gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 6900 
ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 6960 
tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 7020 
gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 70 80 
cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 7140 
gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 7200 
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tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 7260 
ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 7320 
gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 7380 
tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 7440 
catttccccg aaaagtgcca cctgacgtc 7469 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

tgcttaagcg gccgcgaagg agaagtcc 28 

<210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

ccgagctagc gactgaaaat gag 23 

<210> 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

cctctcgaga gacagcaaga cac 23 

<210> 12 
<211> 11152 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 12 

aagcttgggc agaaatggtt gaactcccga gagtgtccta cacctagggg agaagcagcc 60 
aaggggttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag cccatcagac 120 
aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 
ggggaagttg cggttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 240 
gtgcaagatt acaatctaaa caattcggag aactcgacct tcctcctgag gcaaggacca 300 
cagccaactt cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 360 
cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 
atgttaagaa atgaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 
atgggaatag aaaatagaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 
ttgaccacag gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac 600 
aggtggtggc aaccagggac ttatagggga ccttacatct acagaccaac agatgccccc 660 
ttaccatata caggaagata tgacttaaat tgggataggt gggttacagt caatggctat 720 
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aaagtgttat atagatccct cccttttcgt gaaagactcg ccagagctag acctccttgg 780 
tgtatgttgt cfccaagaaga aaaagacgac atgaaacaac aggtacatga ttatatttat 840 
ctaggaacag gaatgcactt ttggggaaag attttccata ccaaggaggg gacagtggct 900 
ggactaatag aacattattc tgcaaaaact catggcatga gttattatga atagccttta 960 
ttggcccaac cttgcggttc ccagggctta agtaagtttt tggttacaaa ctgttcttaa 1020 
aacgaggatg tgagacaagt ggtttcctga cttggtttgg tatcaaaggt tctgatctga 1080 
gctctgagtg ttctattttc ctatgttctt ttggaattta tccaaatctt atgtaaatgc 1140 
ttatgtaaac caagatataa aagagtgctg attttttgag taaacttgca acagtcctaa 1200 
cattcacctc ttgtgtgttt gtgtctgttc gccatcccgt ctccgctcgt cacttatcct 1260 
tcactttcca gagggtcccc ccgcagaccc cggcgaccct caggtcggcc gactgcggca 1320 
gctggcgccc gaacagggac cctcggataa gtgacccttg tctctatttc tactatttgg 13 80 
tgtttgtctt gtattgtctc tttcttgtct ggctatcatc acaagagcgg aacggactca 1440 
ccatagggac caagctagcg actgaaaatg agacatatta tctgccacgg aggtgttatt 1500 
accgaagaaa tggccgccag tcttttggac cagctgatcg aagaggtact ggctgataat 1560 
cttccacctc ctagccattt tgaaccacct acccttcacg aactgtatga tttagacgtg 1620 
acggcccccg aagatcccaa cgaggaggcg gtttcgcaga tttttcccga ctctgtaatg 1680 
ttggcggtgc aggaagggat tgacttactc acttttccgc cggcgcccgg ttctccggag 1740 
ccgcctcacc tttcccggca gcccgagcag ccggagcaga gagccttggg tccggtttct 1800 
atgccaaacc ttgtaccgga ggtgatcgat cttacctgcc acgaggctgg ctttccaccc 1860 
agtgacgacg aggatgaaga gggtgaggag tttgtgttag attatgtgga gcaccccggg 1920 
cacggttgca ggtcttgtca ttatcaccgg aggaatacgg gggacccaga tattatgtgt 1980 
tcgctttgct atatgaggac ctgtggcatg tttgtctaca gtaagtgaaa attatgggca 2040 
gtgggtgata gagtggtggg tttggtgtgg taattttttt tttaattttt acagttttgt 2100 
ggtttaaaga attttgtatt gtgatttttt taaaaggtcc tgtgtctgaa cctgagcctg 2160 
agcccgagcc agaaccggag cctgcaagac ctacccgccg tcctaaaatg gcgcctgcta 2220 
tcctgagacg cccgacatca cctgtgtcta gagaatgcaa tagtagtacg gatagctgtg 22 80 
actccggtcc ttctaacaca cctcctgaga tacacccggt ggtcccgctg tgccccatta 2340 
aaccagttgc cgtgagagtt ggtgggcgtc gccaggctgt ggaatgtatc gaggacttgc 2400 
ttaacgagcc tgggcaacct ttggacttga gctgtaaacg ccccaggcca taaggtgtaa 2460 
acctgtgatt gcgtgtgtgg ttaacgcctt tgtttgctga atgagttgat gtaagtttaa 2520 
taaagggtga gataatgttt aacttgcatg gcgtgttaaa tggggcgggg cttaaagggt 25 80 
atataatgcg ccgtgggcta atcttggtta catctgacct catggaggct tgggagtgtt 2640 
tggaagattt ttctgctgtg cgtaacttgc tggaacagag ctctaacagt acctcttggt 2700 
tttggaggtt tctgtggggc tcatcccagg caaagttagt ctgcagaatt aaggaggatt 2760 
acaagtggga atttgaagag cttttgaaat cctgtggtga gctgtttgat tctttgaatc 2820 
tgggtcacca ggcgcttttc caagagaagg tcatcaagac tttggatttt tccacaccgg 2 8 80 
ggcgcgctgc ggctgctgtt gcttttttga gttttataaa ggataaatgg agcgaagaaa 2940 
cccatctgag cggggggtac ctgctggatt ttctggccat gcatctgtgg agagcggttg 30 00 
tgagacacaa gaatcgcctg ctactgttgt cttccgtccg cccggcgata ataccgacgg 3060 
aggagcagca gcagcagcag gaggaagcca ggcggcggcg gcaggagcag agcccatgga 3120 
acccgagagc cggcctggac cctcgggaat gaatgttgta caggtggctg aactgtatcc 3180 
agaactgaga cgcattttga caattacaga ggatgggcag gggctaaagg gggtaaagag 3240 
ggagcggggg gcttgtgagg ctacagagga ggctaggaat ctagctttta gcttaatgac 3300 
cagacaccgt cctgagtgta t tact tt tea acagatcaag gataattgeg ctaatgagct 3360 
tgatctgctg gcgcagaagt attccataga gcagctgacc acttactggc tgeagecagg 3420 
ggatgatttt gaggaggcta ttagggtata tgcaaaggtg gcacttaggc cagattgeaa 3480 
gtacaagatc agcaaacttg taaatatcag gaattgttgc tacatttctg ggaaegggge 3540 
cgaggtggag atagataegg aggatagggt ggectttaga tgtagcatga taaatatgtg 3600 
geegggggtg cttggcatgg acggggtggt tattatgaat gtaaggttta ctggccccaa 3660 
ttttagcggt aeggttttec tggecaatae caaccttatc etacaeggtg taagcttcta 3720 
tgggtttaac aatacctgtg tggaagcctg gaccgatgta agggttcggg gctgtgcctt 3780 
ttactgctgc tggaaggggg tggtgtgtcg ccccaaaagc agggcttcaa ttaagaaatg 3840 
cctctttgaa aggtgtacct tgggtatcct gtctgagggt aactccaggg tgcgccacaa 3900 
tgtggcctcc gactgtggtt gettcatget agtgaaaagc gtggctgtga ttaagcataa 3960 
catggtatgt ggcaactgcg aggacagggc ctctcagatg ctgacctgct eggaeggcaa 4020 
ctgtcacctg ctgaagacca ttcaegtage cagccactct cgcaaggcct ggccagtgtt 4080 
tgagcataac atactgaccc gctgttcctt gcatttgggt aacaggaggg gggtgttcct 4140 
accttaccaa tgcaatttga gtcacactaa gatattgett gageccgaga gcatgtccaa 4200 
ggtgaacctg aacggggtgt ttgacatgac catgaagatc tggaaggtgc tgaggtacga 4260 
tgagacccgc accaggtgea gaccctgcga gtgtggcggt aaacatatta ggaaccagcc 4320 
tgtgatgctg gatgtgaccg aggagctgag gcccgatcac ttggtgctgg cctgcacccg 4380 
cgctgagttt ggctctagcg atgaagatac agattgaggt actgaaatgt gtgggegtgg 4440 
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IH ilif 111 IP iS ™ B 
Sil! sis i II I II IPS IIS! B 

££22 SSffi£ S22K S ~ SS£= "IS 

ISS s ™ ? fes " "S i 2 1 1! 

SI E£3S ssssg sssss SS£S 

IS IS ! %"-F ™- zssks 

Ei ~- sate? 

III! 111 Hi IP if ii SSi a 

liiliisiligllilllll 

actccgccca gttccgccca ttctccgcoo catggctgac taattttttt tat?tf^= f?fn 
SIftalS tt^caalla f^^cta ttc^fgafgt agtgaggagg ctt^Sfga £25 
Ilclctttat cccIS£^ gcttggacac aagacaggct tgcgagatat gtttgagllt 6240 
cccgcgtcag ggagaggcag tgcgtaaaaa gacgcggact catqtaaaat etao 
actggttttt agtgcgccag atctctataa tctcgcgcaa cctattttcc cctSc «2n 

sis I Hill ssss isssa ss 

MS EES r™ sasss sees ss 

caggcgggtt Iclataatfct ™ 3 f 9 °f t ?? cact 9ccg ggcgttgttc tttttaactt 6900 
acltalllaa ccagtaagta ttctggaggc tgcatccatg acacaggcaa 6960 

acca!t???= aa ccctgttc aaaccccgct ttaaacatcc tgaaacctcg acgctaitcc 7020 
ctcactSSt ESSE!?? caacc ? cct 9 tgcagtcggc ccttgatgg? aalaccltcc vSsS 
aaatcc?^ 5f?° at9att a accgtctga tgtggatctg gcgcggcatt gacccacgcg 7140 
aaa = Ct ° 9a cgtccaggca cgtattgtga tgagcgatgc cgaacgtacc qacaataltt 7?oo 
tttatfaaaa Illf^ 9 ?? tacc 9 fc 99 c 9 glalctgglt ttatglgtgg Relegate 726? 
taafaft a ?f aac ? tta =" ctgtggtgtg acataattgg acaaactacc tacaglfatt 7320 
at a gttgt a tlft aa ag a t tccaaccta? a 9 t9tataat 9tgttaaact actgftfeta 7 ? 380 

I-!! i il 3FS a9 ™ 9 ™- ~b .ss 
fes22 ™ a99 c - a ™ ssss: ssssa ja.° 

"^" a ^= ? a * aca33 = a tagagtgtct gctattaata actatgetea aaaattftgt 7sSS 
aetl^off " ttaattt 9 taaaggggtt aataaggaat atttgltgta tagtgectta 7860 
cccalacctc cccctoaf^ ataccacatt tgtagaggtt ttacttgltt taSflacct ?£S 
tlttlcloct tataaf 33 ^ t 9 aaa « ataa aa tgaatgca attgttgttg ttaacttgtt 7980 
atth?? 3 ?^ ta t aa tggtt acaaataaag caatagcatc acaaatttca caaataaagc 8040 

ctggatclcc aaaaaact^ f^W? 9 tccaaa <*c atcaatgtat cttatcttft 8?oS 
ccggaccccc aggaagctcc tctgtgtcct cataaaccct aacctcctct acttgagagg 8X60 
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acaaaaagga aattgfgtfg gggtttttla clo!^^ . CCt9ttaatt a 9 3 tcactta 8220 
atctgggaag tcccttlcal tfltgtg^tc caaalS^ te * aa 999ta attttaaaat 8 2 fo 
caacagcaga aacatacaag ctgtJalctt tocaoflf^ 99 taaaca 9c ccacaaatgt 8340 
gaagcactgt ggttgctgti ttlgtaltqt a™— 9 " cccaacacc = tgctcatcaa 8400 
tgtaggttcc aaaatatctl gtgtttlcat t?^™ 99 a ?9=acattt tccccacctg 8460 
ctggataagc attatcctta tcfaaaacag ccttltaoff at = a ?? aaCC ca 9cactccl eilo 
caactgtagc attttttggg gttacagttt ala™f£f a ?5 9 " CatC ^tgactgt 8580 
acacaccctg cagctccail ggttccfcac cafeaf™ tt,: 99tcctg tagtttgcta 8640 
gaatgggttt tccagcacca ttttcltgtq ^tttfo^f aaaat 9 aaaa tttgaccctt 8700 
tagcagttac cccaataacc tcagttttaf caatal^ ^ ct 9 aat 9 c aagtttaaca 8760 
oacaggttaa gtcctcattt aaattaggca fao^aatt?? "cccacatc aaaatatttc 8820 
gatacgccta tttttatagg ttaatat^ak £ fc tgaagacgaa agggcctcgt 8880 

cacttttcgg ggaaatgtfl gcgg"a!cccc ^atttctttf ftSFZ* 3 * c 9«~aggtgg 8940 
tatgtatccg ctcatgagac aatfaccctq ataa^ta^? tfc " tc 5 aaa tacattcaaa 9000 
gagtatgagt attcaacltt tccgtgtlg^ cottattccc 9 aaaaa 99^ 9060 

tcctgttttt gctcacccag aaacgctglt afaaot=«f ttttttgcgg cattttgcct 9120 
tgcacgagtg ggttacatcf aactfgatlt laacfa™ ^^^aag atcagttggg 9i 80 
ccccgaagaa ogttttccal tgatlljcf c ■ tttt a! al? t o? 93 ? 0 ?" 9 a 9a9ttttcg 9240 
atcccgtgtt gacgccgggc aagaiclact anlt^ 9 ctgctatgtg gcgcggtatt 9300 
cttggttgag tactcallfg ^cfcfgaaaa SaS^ ^actatt ctcagaatga 9360 
attatgcagt gctgccatal coa^I^ga ?aacact^f I at " C f5 9a ca 9taagaga 9420 
gatcggagga ccgaaggagc taaccqfttt tSS^ff 9 9ccaacttac ttctgacaac 9480 
cottgatcgt tgigaacclg agctgfltaa 11%*? C at 9999gatc atgtaactcg 9540 
gatgcctgca gllltggclf cla^tglg cfaaSatta l^ 9 ^ 9600 
agcttcccgg caacaattaa tagactglat oaaaf^^ act 9?cgaac tacttactct 9660 
gcgctcggcc cttccggctg gctggtttat ta^f*? 9 ^ aaa 9ttgcag gaccacttct 9720 
gtctcgcggt atcatfgWl !ac?ggggcc aqltfotaao ^tggagccg gtgagcgtgg 9780 
ctacacgacg gggagtcagi caactataaa ff«2S ? ccctcccgta tcgtagttat 9840 
tgcctcactg attafgcatt ggtaactftc SatcfaStt a ? a ? a9at ? 9 ct 9 a 9atagg 9 90 o 
tgatttaaaa cttcattttt aatttaaiaa oftot^^ tac tcatata tactttagat 9960 
catgaccaaa atcccttaac gtgagtt^tl Ittcca!^? aa 9 atcct " ttgataatct 10020 
gatcaaagga tcttcttgag ftcctttttt tcto»S 9«9tcagacc ccgtagaaaa 10080 
aaaaccaccg ctaccaglgf tggtttgtS ooclaftfa* atct ? ct 9<=t tgcaaacaaa 10140 
gaaggtaact ggcttcagca galcqcaaafc f^ff=£ * H? gctaccaa ctctttttcc 10200 
gttaggccac clcttcalga Ic^gtljc accgcctala ^t^? 9 ^ ta 9<=cgta 10260 
•gttaccagtg gctgctgcca gtggcgataa atcf^^ 3 taocto 9 c tc tgctaatcct 10320 
atagttaccg gataaggcgc ag^aqtlaaa ^of^ ac =gg9ttgg actcaagacg 10380 
cttggagcga acgacILIa cfgaactHf aScctafff ^ C9t ?° a caca 9clcal 10440 
cacgcttccc gaagggagaa aggcggacaf otafcrr™S <=9tgagctat gagaaagcgc 1OS0O 
agagcgcaog aggllfcftc clgglfgaal gcctglfat c??? 9 ?* 9 ? 9 tc 99 a *cagg 10560 
tcgccacctc tgacttgagc gtcgattttt afcofhfJ?^ ? tttata 9tc ctgtcgggtt 10620 
gaaaaacgcc agcaacgegg clttttfca™ 9 £? at ? CtC9 tca 99g99gc ggagcctatg 10680 
catgttcttt cltgcgttft IlcctJatt? ?=^ 99CC fcttt 9 ct 99<= cttttgctcl 10740 
agctgatacc gctfglcgca |ccgafc|ac clafcac^ ^^^3 cotttfagtg 10800 
ggaagagcgc ctgatgcigt attttc?Ic? tlc%cl^?Z ga 9tcagtga gcgaggaagc 10860 
atggtgcaot ctcagtalla tctgctctS tgollcataf |?° 99tatt t cacaccgclt 10920 
ctatcgctac gtgactgggt eatlgctgla oS^? ttaagccagt atacactccg 10980 
ccctgacggg cttgtctflt ccc?lcatcl gcttfclqlc 112^*°° ^9^4 11040 
agctgcatgt gtcagaggtt ttc^cgtca fcaccgaOc Klcgfglcl |f ct "9gg 11100 

<210> 13 
<211> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

gacggatcgg gagatctcc 

19 
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<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 14 22 
ccgcctcaga agccatagag cc 

<210> 15 
<211> 14455 
<212> DNA 

<213> Artificial Sequence 

<220> ■y J J 

<223> Description of Artificial Sequence: plasmid 

aagcttg|gc agaaatggtt gaactcccga gagtgtccta cacctagggg a9«S£agcc 60 
aaggggttit ttcccaccaa ggacgacccg tctgcgcaca aacggatgag cccatcagac 120 
aaSlcatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct gggctattg 180 
croaaaaatta cagttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 24Q 
PSS2 aclatctala caittSggag aactcgacct tcctcctgag fcaaggacca 300 
lalccalctt cctcttacaa gccgcatcga ttttgtcctt cagaaataga ^aagaatg 360 
cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt ^tagttact 420 
atattaaaaa atqaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 
atSaaatag aaaltajaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 
ttalcctcal gcctaglagt Laaaaggga aaaaagagtg tttttgtcaa aataggagac 600 
ag|?ggtggc alccagggac ttatagggga ccttacatct acagaccaac «J 
ttlccatata cagqaagata tgacttaaat tgggataggt gggttacagt caatggctat 720 
atllatccct cccttttcgt gaaagactcg ccagagctag acctccttgg 780 
tgtatlttgt ctcaagaaga aaaagacgac atgaaacaac aggtacatga "atatttat 840 
^oola^So aaatacactt ttggggaaag attttccata ccaaggaggg gacagtggct 900 
ggac?aatal latattattc tgclllaact catggcatga gttattatga atagccttta 960 

B iS i si s irs e 

II safe sssss S53SS seas eass 

?„l??c,?r-l* atattotctc tttcttgtct ggctatcatc acaagagcgg aacggactca 1440 
*£££2£ laagctagcg actgaalatg Igacatatta tctgccaogg aggtgttatt 1500 

i sees sess sssa sssss ssss 
i™ ssssss sasa esas saas as aj . 
s£ asfe sasa aaa: saas sasa 

i«! sssass aaas sg IE I s ssg 

SSSSS 88338 S5SSSE aSSS SagS -Sgl >I 
ISSS SSSSS SS33H5 SSSK 13 1 1 

actclggtcl ttctaacaca cctcctgaga tacacccggt ggtcccgctg *9«^atta 2340 

arJa ssssE ssss sasa sasa asas 
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acctgtgatt 
taaagggtga 
atataatgcg 
tggaagattt 
tttggaggtt 
acaagtggga 
tgggtcacca 
ggcgcgctgc 
cccatctgag 
tgagacacaa 
aggagcagca 
acccgagagc 
agaactgaga 
ggagcggggg 
cagacaccgt 
tgatctgctg 
ggatgatttt 
gtacaagatc 
cgaggtggag 
gccgggggtg 
ttttagcggt 
tgggtttaac 
ttactgctgc 
cctctttgaa 
tgtggcctcc 
catggtatgt 
ctgtcacctg 
tgagcataac 
accttaccaa 
ggtgaacctg 
tgagacccgc 
tgtgatgctg 
cgctgagttt 
cttaagggtg 
agcagccgcc 
gacaacgcgc 
tggtcgcccc 
gccgttggag 
tgtgactgac 
ccgcgatgac 
tgtcgtttct 
ccctcccaat 
gcaagtgtct 
cataattgga 
gtgtataatg 
gaactgatga 
aagaaatgcc 
aaaagaagag 
gtcatgctgt 
aagctgcact 
ataacagtta 
ctattaataa 
ataaggaata 
gtagaggttt 
atgaatgcaa 
aatagcatca 
tccaaactca 
gggtgtggaa 
tagtcagcaa 
atgcatctca 
actccgccca 
gaggccgagg 



gcgtgtgtgg 
gataatgttt 
ccgtgggcta 
ttctgctgtg 
tctgtggggc 
atttgaagag 
ggcgcttttc 
ggctgctgtt 
cggggggtac 
gaatcgcctg 
gcagcagcag 
cggcctggac 
cgcattttga 
gcttgtgagg 
cctgagtgta 
gcgcagaagt 
gaggaggcta 
agcaaacttg 
atagatacgg 
cttggcatgg 
acggttttcc 
aatacctgtg 
tggaaggggg 
aggtgtacct 
gactgtggtt 
ggcaactgcg 
ctgaagacca 
atactgaccc 
tgcaatttga 
aacggggtgt 
accaggtgca 
gatgtgaccg 
ggctctagcg 
ggaaagaata 
gccgccatga 
atgcccccat 
gtcctgcccg 
actgcagcct 
tttgctttcc 
aagttgacgg 
cagcagctgt 
gcggtttaaa 
tgctgtctct 
caaactacct 
tgttaaacta 
atgggagcag 
atctagtgat 
aaaggtagaa 
gtttagtaat 
gctatacaag 
taatcataac 
ctatgctcaa 
tttgatgtat 
tacttgcttt 
ttgttgttgt 
caaatttcac 
tcaatgtatc 
agtccccagg 
ccaggtgtgg 
attagtcagc 
gttccgccca 
ccgcctcggc 



ttaacgcctt 
aacttgcatg 
atcttggtta 
cgtaacttgc 
tcatcccagg 
cttttgaaat 
caagagaagg 
gcttttttga 
ctgctggatt 
ctactgttgt 
gaggaagcca 
cctcgggaat 
caattacaga 
ctacagagga 
ttacttttca 
attccataga 
ttagggtata 
taaatatcag 
aggatagggt 
acggggtggt 
tggccaatac 
tggaagcctg 
tggtgtgtcg 
tgggtatcct 
gcttcatgct 
aggacagggc 
ttcacgtagc 
gctgttcctt 
gtcacactaa 
ttgacatgac 
gaccctgcga 
aggagctgag 
atgaagatac 
tataaggtgg 
gcaccaactc 
gggccggggt 
caaactctac 
ccgccgccgc 
tgagcccgct 
ctcttttggc 
tggatctgcg 
acataaataa 
cgagggatct 
acagagattt 
ctgattctaa 
tggtggaatg 
gatgaggcta 
gaccccaagg 
agaactcttg 
aaaattatgg 
atactgtttt 
aaattgtgta 
agtgccttga 
aaaaaacctc 
taacttgttt 
aaataaagca 
ttatcatgtc 
ctccccagca 
aaagtcccca 
aaccatagtc 
ttctccgccc 
ctctgagcta 



tgtttgctga 
gcgtgttaaa 
catctgacct 
tggaacagag 
caaagttagt 
cctgtggtga 
tcatcaagac 
gttttataaa 
ttctggccat 
cttccgtccg 
ggcggcggcg 
gaatgttgta 
ggatgggcag 
ggctaggaat 
acagatcaag 
gcagctgacc 
tgcaaaggtg 
gaattgttgc 
ggcctttaga 
tattatgaat 
caaccttatc 
gaccgatgta 
ccccaaaagc 
gtctgagggt 
agtgaaaagc 
ctctcagatg 
cagccactct 
gcatttgggt 
gatattgctt 
catgaagatc 
gtgtggcggt 
gcccgatcac 
agattgaggt 
gggtcttatg 
gtttgatgga 
gcgtcagaat 
taccttgacc 
ttcagccgct 
tgcaagcagt 
acaattggat 
ccagcaggtt 
aaaaccagac 
ttgtgaagga 
aaagctctaa 
ttgtttgtgt 
cctttaatga 
ctgctgactc 
actttccttc 
cttgctttgc 
aaaaatattc 
ttcttactcc 
cctttagctt 
ctagagatca 
ccacacctcc 
attgcagctt 
tttttttcac 
tggatccggc 
ggcagaagta 
ggctccccag 
ccgcccctaa 
catggctgac 
ttccagaagt 



atgagttgat 
tggggcgggg 
catggaggct 
ctctaacagt 
ctgcagaatt 
gctgtttgat 
tttggatttt 
ggataaatgg 
gcatctgtgg 
cccggcgata 
gcaggagcag 
caggtggctg 
gggctaaagg 
ctagctttta 
gataattgcg 
acttactggc 
gcacttaggc 
tacatttctg 
tgtagcatga 
gtaaggttta 
ctacacggtg 
agggttcggg 
agggcttcaa 
aactccaggg 
gtggctgtga 
ctgacctgct 
cgcaaggcct 
aacaggaggg 
gagcccgaga 
tggaaggtgc 
aaacatatta 
ttggtgctgg 
actgaaatgt 
tagttttgta 
agcattgtga 
gtgatgggct 
tacgagaccg 
gcagccaccg 
gcagcttccc 
tctttgaccc 
tctgccctga 
tctgtttgga 
accttacttc 
ggtaaatata 
attttagatt 
ggaaaacctg 
tcaacattct 
agaattgcta 
tatttacacc 
tgtaaccttt 
acacaggcat 
tttaatttgt 
taatcagcca 
ccctgaacct 
ataatggtta 
tgcattctag 
tgtggaatgt 
tgcaaagcat 
caggcagaag 
ctccgcccat 
taattttttt 
agtgaggagg 



gtaagtttaa 
cttaaagggt 
tgggagtgtt 
acctcttggt 
aaggaggatt 
tctttgaatc 
tccacaccgg 
agcgaagaaa 
agagcggttg 
ataccgacgg 
agcccatgga 
aactgtatcc 
gggtaaagag 
gcttaatgac 
ctaatgagct 
tgcagccagg 
cagattgcaa 
ggaacggggc 
taaatatgtg 
ctggccccaa 
taagcttcta 
gctgtgcctt 
ttaagaaatg 
tgcgccacaa 
ttaagcataa 
cggacggcaa 
ggccagtgtt 
gggtgttcct 
gcatgtccaa 
tgaggtacga 
ggaaccagcc 
cctgcacccg 
gtgggcgtgg 
tctgttttgc 
gct:catattt 
ccagcattga 
tgtctggaac 
cccgcgggat 
gttcatccgc 
gggaacttaa 
aggcttcctc 
tttggatcaa 
tgtggtgtga 
aaatttttaa 
ccaacctatg 
ttttgctcag 
actcctccaa 
agttttttga 
acaaaggaaa 
ataagtaggc 
agagtgtctg 
aaaggggtta 
taccacattt 
gaaacataaa 
caaataaagc 
ttgtggtttg 
gtgtcagtta 
gcatctcaat 
tatgcaaagc 
cccgccccta 
tatttatgca 
cttttttgga 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000- 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 




PCT/EP01/04863 

WO 01/83729 

15 



ggcctaggct tttgcaaaaa 
accactttat cccgcgtcag 
actggttttt agtgcgccag 
tttttaagcc gtagataaac 
ctgggacatg ttgcagatcc 
atggaaaggc attattgccg 
tgaactgggt attcgtcatg 
gcgcgagctt aaagtgctga 
tgacctggtg gataccggtg 
ctttgtcacc atcttcgcaa 
tatcccgcaa gatacctgga 
aatctccggt cgctaatctt 
caggcgggtt acaatagttt 
acctgagcga aaccctgttc 
gccgctttaa tcacggcgca 
ctcactggta tcgcatgatt 
aaatcctcga cgtccaggca 
tatacgatac ggtgattggc 
tttgtgaagg aaccttactt 
taaagctcta aggtaaatat 
attgtttgtg tattttagat 
gcctttaatg aggaaaacct 
actgctgact ctcaacattc 
gactttcctt cagaattgct 
gcttgctttg ctatttacac 
gaaaaatatt ctgtaacctt 
tttcttactc cacacaggca 
acctttagct ttttaatttg 
actagagatc ataatcagcc 
cccacacctc cccctgaacc 
tattgcagct tataatggtt 
atttttttca ctgcattcta 
ctggatcccc aggaagctcc 
acattccaat cataggctgc 
acaaaaagga aattgggtag 
atctgggaag tcccttccac 
caacagcaga aacatacaag 
gaagcactgt ggttgctgtg 
tgtaggttcc aaaatatcta 
ctggataagc attatcctta 
caactgtagc attttttggg 
acacaccctg cagctccaaa 
gaatgggttt tccagcacca 
tagcagttac cccaataacc 
cacaggttaa gtcctcattt 
gatacgccta tttttatagg 
cacttttcgg ggaaatgtgc 
tatgtatccg ctcatgagac 
gagtatgagt attcaacatt 
tcctgttttt gctcacccag 
tgcacgagtg ggttacatcg 
ccccgaagaa cgttttccaa 
atcccgtgtt gacgccgggc 
cttggttgag tactcaccag 
attatgcagt gctgccataa 
gatcggagga ccgaaggagc 
ccttgatcgt tgggaaccgg 
gatgcctgca gcaatggcaa 
agcttcccgg caacaattaa 
gcgctcggcc cttccggctg 
gtctcgcggt atcattgcag 
ctacacgacg gggagtcagg 



gcttggacac 
ggagaggcag 
atctctataa 
aggctgggac 
atgcacgtaa 
taagccgtgg 
tcgataccgt 
aacgcgcaga 
gtactgcggt 
aaccggctgg 
ttgaacagcc 
ttcaacgcct 
ccagtaagta 
aaaccccgct 
caaccgcctg 
aaccgtctga 
cgtattgtga 
taccgtggcg 
ctgtggtgtg 
aaaattttta 
tccaacctat 
gttttgctca 
tactcctcca 
aagttttttg 
cacaaaggaa 
tataagtagg 
tagagtgtct 
taaaggggtt 
ataccacatt 
tgaaacataa 
acaaataaag 
gttgtggttt 
tctgtgtcct 
ccatccaccc 
gggtttttca 
tgctgtgttc 
ctgtcagctt 
ttagtaatgt 
gtgttttcat 
tccaaaacag 
gttacagttt 
ggttccccac 
ttttcatgag 
tcagttttaa 
aaattaggca 
ttaatgtcat 
gcggaacccc 
aataaccctg 
tccgtgtcgc 
aaacgctggt 
aactggatct 
tgatgagcac 
aagagcaact 
tcacagaaaa 
ccatgagtga 
taaccgcttt 
agctgaatga 
caacgttgcg 
tagactggat 
gctggtttat 
cactggggcc 
caactatgga 



aagacaggct 
tgcgtaaaaa 
tctcgcgcaa 
acttcacatg 
actcgcaagc 
cggtctggta 
ttgtatttcc 
aggcgatggc 
tgcgattcgt 
tcgtccgctg 
gtgggatatg 
ggcactgccg 
ttctggaggc 
ttaaacatcc 
tgcagtcggc 
tgtggatctg 
tgagcgatgc 
gcaactggat 
acataattgg 
agtgtataat 
ggaactgatg 
gaagaaatgc 
aaaaagaaga 
agtcatgctg 
aaagctgcac 
cataacagtt 
gctattaata 
aataaggaat 
tgtagaggtt 
aatgaatgca 
caatagcatc 
gtccaaactc 
cataaaccct 
tctgtgtcct 
cagaccgctt 
cagaagtgtt 
tgcacaaggg 
gcaaaacagg 
ttttacttgg 
ccttgtggtc 
gagcaggata 
caacagcaaa 
ttttttgtgt 
cagtaacagc 
aaggaattct 
gataataatg 
tatttgttta 
ataaatgctt 
ccttattccc 
gaaagtaaaa 
caacagcggt 
ttttaaagtt 
cggtcgccgc 
gcatcttacg 
taacactgcg 
tttgcacaac 
agccatacca 
caaactatta 
ggaggcggat 
tgctgataaa 
agatggtaag 
tgaacgaaat 



tgcgagatat 
gacgcggact 
cctattttcc 
agcgaaaaat 
cgactgatgc 
ccgggtgcgt 
agctacgatc 
gaaggcttca 
gaaatgtatc 
gttgatgact 
ggcgtcgtat 
ggcgttgttc 
tgcatccatg 
tgaaacctcg 
ccttgatggt 
gcgcggcatt 
cgaacgtacc 
ttatgagtgg 
acaaactacc 
gtgttaaact 
aatgggagca 
catctagtga 
gaaaggtaga 
tgtttagtaa 
tgctatacaa 
ataatcataa 
actatgctca 
atttgatgta 
ttacttgctt 
attgttgttg 
acaaatttca 
atcaatgtat 
aacctcctct 
cctgttaatt 
tctaagggta 
ggtaaacagc 
cccaacaccc 
aggcacattt 
atcaggaacc 
agtgttcatc 
tttggtcctg 
aaaatgaaaa 
ccctgaatgc 
ttcccacatc 
tgaagacgaa 
gtttcttaga 
tttttctaaa 
caataatatt 
ttttttgcgg 
gatgctgaag 
aagatccttg 
ctgctatgtg 
atacactatt 
gatggcatga 
gccaacttac 
atgggggatc 
aacgacgagc 
actggcgaac 
aaagttgcag 
tctggagccg 
ccctcccgta 
agacagatcg 



gtttgagaat 
catgtgaaat 
cctcgaacac 
acatcgtcac 
cttctgaaca 
tactggcgcg 
acgacaacca 
tcgttattga 
caaaagcgca 
atgttgttga 
tcgtcccgcc 
tttttaactt 
acacaggcaa 
acgctagtcc 
aaaaccatcc 
gacccacgcg 
gacgatgatt 
gccccggatc 
tacagagatt 
actgattcta 
gtggtggaat 
tgatgaggct 
agaccccaag 
tagaactctt 
gaaaattatg 
catactgttt 
aaaattgtgt 
tagtgccttg 
taaaaaacct 
ttaacttgtt 
caaataaagc 
cttatcatgt 
acttgagagg 
aggtcactta 
attttaaaat 
ccacaaatgt 
tgctcatcaa 
tccccacctg 
cagcactcca 
tgctgactgt 
tagtttgcta 
tttgaccctt 
aagtttaaca 
aaaatatttc 
agggcctcgt 
cgtcaggtgg 
tacattcaaa 
gaaaaaggaa 
cattttgcct 
atcagttggg 
agagttttcg 
gcgcggtatt 
ctcagaatga 
cagtaagaga 
ttctgacaac 
atgtaactcg 
gtgacaccac 
tacttactct 
gaccacttct 
gtgagcgtgg 
tcgtagttat 
ctgagatagg 



6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
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tgcctcactg 

tgatttaaaa 

catgaccaaa 

gatcaaagga 

aaaaccaccg 

gaaggtaact 

gttaggccac 

gttaccagtg 

atagttaccg 

cttggagcga 

cacgcttccc 

agagcgcacg 

tcgccacctc 

gaaaaacgcc 

catgttcttt 

agctgatacc 

ggaagagcgc 

accgcctcag 

tcctccccct 

caatgcgatg 

gtcaaggaag 

gaggctgatc 

atgctcgagc 

gcaaatattt 

gcacttggag 

ggtttagttt 

ccgtgagatt 

gtgccttcag 

tttagtagca 

gatattggag 

ccacattttg 

ggagatggag 

atggcacctg 

gaatcaaatt 

ggtgaaccaa 

aaccccttgg 

aagtaaaggc 

ccctgtccta 

ataaggcgtc 

ccttgtgaat 

gtcacacctg 

gccccgtact 

cccaagctac 

tcggtggtgg 

ggtccttggg 

atggtgagtg 

gaggtaactg 

ggtgggctca 

ccgttgccca 

aaagagagta 

agaaaaggca 

tcttcagacg 

catccgccgt 

caaccccgac 

ctggttagac 

cctcggtggc 

accgcgaaga 

tcggtaccaa 

agtgggttct 

accgcccatt 

ttggtgccaa 

gtcaaaccgc 



attaagcatt 

cttcattttt 

atcccttaac 

tcttcttgag 

ctaccagcgg 

ggcttcagca 

cacttcaaga 

gctgctgcca 

gataaggcgc 

acgacctaca 

gaagggagaa 

agggagcttc 

tgacttgagc 

agcaacgcgg 

cctgcgttat 

gctcgccgca 

ctgatgcggt 

aagccataga 

tgctgtcctg 

caatttcctc 

gcacggggga 

agcgagctct 

ggccgcttct 

cattaatgta 

ttgtgtctcc 

tgtctccgtt 

ttggataagc 

taagatctcc 

ctccattttc 

ccaaactgcc 

ttaagaccaa 

ctggtgtggt 

tgctgtcaaa 

ctaggccatg 

attcaagccc 

cagtgcttag 

ctttgttgta 

gtcttagatt 

tgttttgaga 

caaaacccaa 

gaccagttgc 

ttagtcctag 

cagtggcagt 

tgagggggcc 

tggcaatgct 

tgttgcccgc 

tgaggggtgc 

cagtggttac 

ttttgagcgc 

ccccaggggg 

cagttggagg 

gtcttgcgcg 

ctcaagaccg 

cgccacccgc 

gcctttctcg 

ggagtaccgt 

gtttgtcctc 

gcttgggtct 

ctagttagcc 

tgcgtcaatg 

aacaaactcc 

tatccacgcc 



ggtaactgtc 

aatttaaaag 

gtgagttttc 

atcctttttt 

tggtttgttt 

gagcgcagat 

actctgtagc 

gtggcgataa 

agcggtcggg 

ccgaactgag 

aggcggacag 

cagggggaaa 

gtcgattttt 

cctttttacg 

cccctgattc 

gccgaacgac 

attttctcct 

gcccaccgca 

ccccacccca 

attttattag 

ggggcaaaca 

agcatttagg 

ttattcttgg 

gttgtggcca 

tgtttcctgt 

taagtaaact 

tgataggtta 

atttctaaag 

gtcaaatctt 

tttaacagcc 

agtgagttta 

ccacaaagtt 

actaaggcca 

gccaattttt 

atctcctgca 

gttaacctca 

gttaatatcc 

tagttggttt 

atcaatcctt 

ggctccagta 

tacggtcaaa 

ttttccattt 

agttagaggg 

tgatgtttgc 

aagtttggag 

gaccattaga 

agatatttcc 

attttgggag 

aagcatgcca 

actctcttga 

accggtttcc 

cttcatcttg 

cctactttaa 

tgccgcccgc 

agaggttttc 

tcggaggccg 

aaccgcgagc 

ccctatagtg 

agagagctct 

gggcggagtt 

cattgacgtc 

cattgatgta 



agaccaagtt 

gatctaggtg 

gttccactga 

tctgcgcgta 

gccggatcaa 

accaaatact 

accgcctaca 

gtcgtgtctt 

ctgaacgggg 

atacctacag 

gtatccggta 

cgcctggtat 

gtgatgctcg 

gttcctggcc 

tgtggataac 

cgagcgcagc 

tacgcatctg 

tccccagcat 

ccccccagaa 

gaaaggacag 

acagatggct 

tgacactata 

gcaatgtatg 

gaccagtccc 

gtaccgttta 

tgactgacaa 

ggcataaatc 

ttccaatatt 

ataataagat 

aaaactgaaa 

gcatctttct 

agcttatcat 

gttcctagtt 

gttttgaggg 

ttaatggcta 

agctttttgg 

aagttgtggg 

tgagcatcaa 

agtcctcctg 

actttagttt 

gtgtttaggt 

tgtgtataaa 

ggtgaggcag 

agggctagct 

tcgtgcacgg 

ggtgcggcgg 

aggtttatgt 

gtaaggttgc 

ttggaggtaa 

aacccattgg 

gtgtcatatg 

gatctcaagc 

ttacatcatc 

cacggtgctc 

cgatccggtc 

acgggtttcc 

ccaacagcga 

agtcgtatta 

gcttatatag 

gttacgacat 

aatggggtgg 

ctgccaaaac 



tactcatata 

aagatccttt 

gcgtcagacc 

atctgctgct 

gagctaccaa 

gtccttctag 

tacctcgctc 

accgggttgg 

ggttcgtgca 

cgtgagctat 

agcggcaggg 

ctttatagtc 

tcaggggggc 

ttttgctggc 

cgtattaccg 

gagtcagtga 

tgcggtattt 

gcctgctatt 

tagaatgaca 

tgggagtggc 

ggcaactaga 

gaatagggcc 

aaaaagtgta 

atgaaaatga 

gtgtaatggt 

tgttactttt 

caacagcgtt 

ctgggtccag 

gagcactttg 

ctgtagcaag 

ctgcatttag 

tatttttgtt 

taggaaccat 

gatttgtgtt 

tggctgtagc 

aattgtttga 

ctgagtttat 

acggataact 

ctacattaag 

gcaaggaagt 

cgtctgttac 

tgggctcttt 

tgatagtaag 

ttccttctga 

ttagcggggc 

cagccacagt 

ttgacttggt 

cggcctcgtc 

ctagaggttc 

gggatacaaa 

gatacacggg 

ctgccacacc 

agcagcacct 

agcctacctt 

gatgcggact 

gatccaagag 

gctcgaattc 

atttcgataa 

acctcccacc 

tttggaaagt 

agacttggaa 

cgcatcacca 



tactttagat 

ttgataatct 

ccgtagaaaa 

tgcaaacaaa 

ctctttttcc 

tgtagccgta 

tgctaatcct 

actcaagacg 

cacagcccag 

gagaaagcgc 

tcggaacagg 

ctgtcgggtt 

ggagcctatg 

cttttgctca 

cctttgagtg 

gcgaggaagc 

cacaccgcat 

gtcttcccaa 

cctactcaga 

accttccagg 

aggcacagtc 

ctctagatgc 

agaggatgtg 

catagagtat 

tagtgttaca 

ggcagtttta 

tgtataggct 

gaaggaattg 

aactgttcca 

tatttgactg 

tctacagtta 

tcctactgta 

agccttgttt 

tggtgcatta 

gtcaaacatc 

agctgtaaac 

aaaaagaggg 

aacatcaagt 

ttgcatattg 

attattaata 

atgcaaagga 

caagtcaatg 

ggtactgcta 

cactgtgagg 

ctgtgattgc 

tagggcttct 

ttttttgaga 

cagagagagg 

ggataggcgc 

gggaggagta 

gttgaaggta 

tcacctcgac 

ccgccagaaa 

gcgactgtga 

cgctcaggtc 

tactggaaag 

agatccgagc 

gccagtaagc 

gtacacgcct 

cccgttgatt 

atccccgtga 

tggtaatagc 



9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 
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gatgactaat acgtagatgt actgccaagt 
ataatgccag gcgggccatt taccgtcatt 
atacacttga tgtactgcca agtgggcagt 
atggaaagtc cctattggcg ttactatggg 
ggggtcgttg ggcggtcagc caggcgggcc 
catatatggg ctatgaacta atgaccccgt 
tcaatgtcaa cgcgtatatc tggcccgtac 
aagcagattc ttcatgcaat tgtcggtcaa 
cgcgcactac tcagcgacct ccaacacaca 
ggcatcagag cagattgtac tgagagtcga 
tctatggtgc actctcagta caatctgctc 
ccgctatcgc tacgtgactg ggtcatggct 
gcgccctgac gggcttgtct gctcccggca 
gggagctgca tgtgtcagag gttttcaccg 



aggaaagtcc cataaggtca tgtactgggc 13680 
gacgtcaata gggggcgtac ttggcatatg 13740 
ttaccgtaaa tagtccaccc attgacgtca 13800 
aacatacgtc attattgacg tcaatgggcg 13860 
atttaccgta agttatgtaa cgcggaactc 13920 
aattgattac tattaataac tagtcaataa 13980 
atcgcgaagc agcgcaaaac gcctaaccct 14040 
gccttgcctt gttgtagctt aaattttgct 14100 
agcagggagc agatactggc ttaactatgc 14160 
ccatagggga tcgggagatc tcccgatccg 14220 
tgatgccgca tagttaagcc agtatacact 14280 
gcgccccgac acccgccaac acccgctgac 14340 
tccgcttaca gacaagctgt gaccgtctcc 14400 
tcatcaccga aacgcgcgag gcagc 14455 



<210> 16 

<211> 10610 

<212> DMA 

<213> Artificial Sequence 



<220> „ _ . , 

<223> Description of Artificial Sequence: plasmxd 



<400> 16 

gacggatcgg 

tctatgtcat 

tcctcttaca 

cgtgtttatt 

accaccacat 

caacctgcca 

aaaaagcatc 

ctgtcgagcc 

catgtcgctg 

cggcgaagga 

gcggtggtgc 

atacaacatg 

tgtcctccgg 

cagcaccaca 

ggggaccaca 

cctcataaac 

ccggtaccat 

ggccaaaacc 

gagagcccag 

acacaggcac 

atcccaggga 

cacgtaactc 

cagtatggta 

gcgccgagac 

agtcatattt 

ctcgccgctt 

ggcgccccct 

ccaccaccgc 

cgggaggagc 

aaaacctcaa 

tctacagcca 

caaacggccc 

ataaacattc 

atatctctaa 

ccctccacct 

cctgtataag 

gcagggccag 



gagatccgcg 
tttcatggga 
ctttttcata 
tttcaattgc 
agcttataca 
cctccctccc 
atatcatggg 
aaacgctcat 
tccagctgct 
gaagtccacg 
tgcagcagcg 
gcagtggtct 
gcacagcagc 
atattgttca 
gaacccacgt 
acgctggaca 
ataaacctct 
tgcccgccgg 
gactcgtaac 
acgtgcatac 
acaacccatt 
acgttgtgca 
gcgcgggttt 
aaccgagatc 
cctgaagcaa 
agatcgctct 
ggcttcgggt 
ag-aataagcc 
gggaagagct 
aatgaagatc 
aagaacagat 
tcacgtccaa 
cagcaccttc 
gcaaatcccg 
tcagcctcaa 
attcaaaagc 
ctgaacataa 



cggtacacag 
ctggtctggc 
cattgcccaa 
agaaaatttc 
gatcaccgta 
aacacacaga 
taacagacat 
cagtgatatt 
gagccacagg 
cctacatggg 
cgcgaataaa 
cctcagcgat 
gcaccctgat 
aaatcccaca 
ggccatcata 
taaacattac 
gattaaacat 
ctatacactg 
catggatcat 
acttcctcag 
cctgaatcag 
ttgtcaaagt 
ctgtctcaaa 
gtgttggtcg 
aaccaggtgc 
gtgtagtagt 
tctatgtaaa 
acacccagcc 
ggaagaacca 
tattaagtga 
aatggcattt 
gtggacgtaa 
aaccatgccc 
aatattaagt 
gcagcgaatc 
ggaacattaa 
tcgtgcaggt 



aattcaggag 
cacaactaca 
gaataaagaa 
aagtcatttt 
ccttaatcaa 
gtacacagtc 
attcttaggt 
aataaactcc 
ctgctgtcca 
ggtagagtca 
ctgctgccgc 
gattcgcacc 
ctcacttaaa 
gtgcaaggcg 
ccacaagcgc 
ctcttttggc 
ggcgccatcc 
cagggaaccg 
catgctcgtc 
gattacaagc 
cgtaaatccc 
gttacattcg 
aggaggtaga 
tagtgtcatg 
gggcgtgaca 
tgtagtatat 
ctccttcatg 
aacctacaca 
tgtttttttt 
acgcgctccc 
gtaagatgtt 
aggctaaacc 
aaataattct 
ccggccattg 
atgattgcaa 
caaaaatacc 
ctgcacggac 



acacaactcc 

ttaatgaaat 

tcgtttgtgt 

tcattcagta 

actcacagaa 

ctttctcccc 

gttatattcc 

ccgggcagct 

acttgcggtt 

taatcgtgca 

cgccgctccg 

gcccgcagca 

tcagcacagt 

ctgtatccaa 

aggtagatta 

atgttgtaat 

accaccatcc 

ggactggaac 

atgatatcaa 

tcctcccgcg 

acactgcagg 

ggcagcagcg 

cgatccctac 

ccaaatggaa 

aacagatctg 

ccactctctc 

cgccgctgcc 

ttcgttctgc 

tttattccaa 

ctccggtggc 

gcacaatggc 

cttcagggtg 

catctcgcca 

taaaaatctg 

aaattcaggt 

gcgatcccgt 

cagcgcggcc 



aagtgcatac 

atttgccaca 

tatgtttcaa 

gtatagcccc 

ccctagtatt 

ggctggcctt 

acacggtttc 

cacttaagtt 

gcttaacggg 

tcaggatagg 

tcctgcagga 

taaggcgcct 

aactgcagca 

agctcatggc 

agtggcgacc 

tcaccacctc 

taaaccagct 

aatgacagtg 

tgttggcaca 

ttagaaccat 

gaagacctcg 

gatgatcctc 

tgtacggagt 

cgccggacgt 

cgtctccggt 

aaagcatcca 

ctgataacat 

gagtcacaca 

aagattatcc 

gtggtcaaac 

ttccaaaagg 

aatctcctct 

ccttctcaat 

ctccagagcg 

tcctcacaga 

aggtcccttc 

acttccccgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 - 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 
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caggaacctt 
ccagcgtagc 
ctcaaaaaat 
agataaaggc 
atgtctgcgg 
aagcctgtct 
cggcgtgacc 
tcatgtccgg 
gctaaaaagc 
acagccccca 
aaaccctcct 
cagcggcagc 
ctcgacacgg 
atatatagga 
cacgcgaacc 
ttccgttttc 
tcagtacaat 
tggaggtcgc 
acaattgcat 
ccagatatac 
cattagttca 
ctggctgacc 
taacgccaat 
acttggcagt 
gtaaatggcc 
agtacatcta 
atgggcgtgg 
atgggagttt 
ccccattgac 
tctggctaac 
gggagaccca 
gttgaggaca 
cgaacggtac 
tctcgagaaa 
gcagcgggtg 
aggcggtctt 
aagcgcgcaa 
ggtcctccaa 
agtccccctg 
cttgcgctca 
aatgtaacca 
tctgcacccc 
gtcgcgggca 
cttagcattg 
tcaggccccc 
actactgcca 
aaactaggac 
gtagcaactg 
gccttgggtt 
gattctcaaa 
ctaaatctaa 
aactacaaca 
aacctaagca 
gatgggcttg 
ggccatggcc 
agttttgaca 
tggaccacac 
actttggtct 
aaaggcagtt 
gacgaaaatg 
aatggagatc 
tcagcttatc 



gacaaaagaa 
cccgatgtaa 
caggcaaagc 
aggtaagctc 
gtttctgcat 
tacaacagga 
gtaaaaaaac 
agtcataatg 
gaccgaaata 
taggaggtat 
gcctaggcaa 
ctaacagtca 
caccagctca 
ctaaaaaatg 
tacgcccaga 
ccacgttacg 
ctgctctgat 
tgagtagtgc 
gaagaatctg 
gcgttgacat 
tagcccatat 
gcccaacgac 
agggactttc 
acatcaagtg 
cgcctggcat 
cgtattagtc 
atagcggttt 
gttttggcac 
gcaaatgggc 
tagagaaccc 
agcttggtac 
aactcttcgc 
tccgccaccg 
ggcgtctaac 
gcggtcgggg 
gagacggcgg 
gaccgtctga 
ctgtgccttt 
gggtactctc 
aaatgggcaa 
ctgtgagccc 
tcacagttac 
acacactcac 
ccacccaagg 
tcaccaccac 
ctggtagctt 
taaagtacgg 
gtccaggtgt 
ttgattcaca 
acagacgcct 
gactaggaca 
aaggccttta 
ctgccaaggg 
aatttggttc 
tagaatttga 
gcacaggtgc 
cagctccatc 
taacaaaatg 
tggctccaat 
gagtgctact 
ttactgaagg 
caaaatctca 



cccacactga 
gctttgttgc 
ctcgcgcaaa 
cggaaccacc 
aaacacaaaa 
aaaacaaccc 
tggtcaccgt 
taagactcgg 
gcccggggga 
aacaaaatta 
aatagcaccc 
gccttaccag 
atcagtcaca 
acgtaacggt 
aacgaaagcc 
taacttcccg 
gccgcatagt 
gcgagcaaaa 
cttagggtta 
tgattattga 
atggagttcc 
ccccgcccat 
cattgacgtc 
tatcatatgc 
tatgcccagt 
atcgctatta 
gactcacggg 
caaaatcaac 
ggtaggcgtg 
actgcttact 
cgagctcgga 
ggtctttcca 
agggacctga 
cagtcacagt 
ttgtttctgg 
atggtcgagg 
agataccttc 
tcttactcct 
tttgcgccta 
cggcctctct 
acctctcaaa 
ctcagaagcc 
catgcaatca 
acccctcaca 
cgatagcagt 
gggcattgac 
ggctcctttg 
gactattaat 
aggcaatatg 
tatacttgat 
gggccctctt 
cttgtttaca 
gttgatgttt 
acctaatgca 
ttcaaacaag 
cattacagta 
tcctaactgt 
tggcagtcaa 
atctggaaca 
aaacaattcc 
cacagcctat 
cggtaaaact 



ttatgacacg 
atgggcggcg 
aaagaaagca 
acagaaaaag 
taaaataaca 
ttataagcat 
gattaaaaag 
taaacacatc 
atacataccc 
ataggagaga 
tcccgctcca 
taaaaaagaa 
gtgtaaaaaa 
taaagtccac 
aaaaaaccca 
gatcctctcc 
taagccagta 
tttaagctac 
ggcgttttgc 
ctagttatta 
gcgttacata 
tgacgtcaat 
aatgggtgga 
caagtacgcc 
acatgacctt 
ccatggtgat 
gatttccaag 
gggactttcc 
tacggtggga 
ggcttatcga 
tctgaattcg 
gtactcttgg 
gcgagtccgc 
cgcaaggtag 
cggaggtgct 
tgaggtgtgg 
aaccccgtgt 
ccctttgtat 
tccgaacctc 
ctggacgagg 
aaaaccaagt 
ctaactgtgg 
caggccccgc 
gtgtcagaag 
acccttacta 
ttgaaagagc 
catgtaacag 
aatacttcct 
caacttaatg 
gttagttatc 
tttataaact 
gcttcaaaca 
gacgctacag 
ccaaacacaa 
gctatggttc 
ggaaacaaaa 
agactaaatg 
atacttgcta 
gttcaaagtg 
ttcctggacc 
acaaacgctg 
gccaaaagta 



catactcgga 
atataaaatg 
catcgtagtc 
acaccatttt 
aaaaaacatt 
aagacggact 
caccaccgac 
aggttgattc 
gcaggcgtag 
aaaacacata 
gaacaacata 
aacctattaa 
gggccaagtg 
aaaaaacacc 
caacttcctc 
cgatccccta 
tctgctccct 
aacaaggcaa 
gctgcttcgc 
atagtaatca 
acttacggta 
aatgacgtat 
ctatttacgg 
ccctattgac 
atgggacttt 
gcggttttgg 
tctccacccc * 
aaaatgtcgt 
ggtctatata 
aattaatacg 
agctcgctgt 
atcggaaacc 
atcgaccgga 
gctgagcacc 
gctgatgatg 
caggcttgag 
atccatatga 
cccccaatgg 
tagttacctc 
ccggcaacct 
caaacataaa 
ctgccgccgc 
taaccgtgca 
gaaagctagc 
tcactgcctc 
ccatttatac 
acgacctaaa 
tgcaaactaa 
tagcaggagg 
cgtttgatgc 
cagcccacaa 
attccaaaaa 
ccatagccat 
atcccctcaa 
ctaaactagg 
ataatgataa 
cagagaaaga 
cagtttcagt 
ctcatcttat 
cagaatattg 
ttggatttat 
acattgtcag 



gctatgctaa 
caaggtgctg 
atgctcatgc 
tctctcaaac 
taaacattag 
acggccatgc 
agctcctcgg 
atcggtcagt 
agacaacatt 
aacacctgaa 
cagcgcttca 
aaaaacacca 
cagagcgagt 
cagaaaaccg 
aaatcgtcac 
tggtcgactc 
gcttgtgtgt 
ggcttgaccg 
gatgtacggg 
attacggggt 
aatggcccgc 
gttcccatag 
taaactgccc 
gtcaatgacg 
cctacttggc 
cagtacatca 
*attgacgtca 
aacaactccg 
agcagagctc 
actcactata 
tgggctcgcg 
cgtcggcctc 
tcggaaaacc 
gtggcgggcg 
taattaaagt 
atccaagatg 
cacggaaacc 
gtttcaagag 
caatggcatg 
tacctcccaa 
cctggaaata 
acctctaatg 
cgactccaaa 
cctgcaaaca 
accccctcta 
acaaaatgga 
cactttgacc 
agttactgga 
actaaggatt 
tcaaaaccaa 
cttggatatt 
gcttgaggtt 
taatgcagga 
aacaaaaatt 
aactggcctt 
gctaactttg 
tgctaaactc 
tttggctgtt 
tataagattt 
gaactttaga 
gcctaaccta 
tcaagtttac 



2280 
2340 . 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
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ttaaacggag 
acaggagaca 
aactacatta 
taaagaagcg 
gctagagctc 
ccctcccccg 
aatgaggaaa 
gggcaggaca 
ggctctatgg 
ccctgtagcg 
cttgccagcg 
gccggctttc 
ttacggcacc 
ccctgataga 
ttgttccaaa 
attttgggga 
aattaattct 
ggcagaagta 
ggctccccag 
ccgcccctaa 
catggctgac 
ttccagaagt 
gcttgtatat 
gaacaagatg 
gactgggcac 
gggcgcccgg 
gaggcagcgc 
gttgtcactg 
ctgtcatctc 
ctgcatacgc 
cgagcacgta 
caggggctcg 
gatctcgtcg 
ttttctggat 
ttggctaccc 
ctttacggta 
ttcttctgag 
cacgagattt 
gggacgccgg 
ccaacttgtt 
caaataaagc 
cttatcatgt 
tgtttcctgt 
taaagtgtaa 
cactgcccgc 
gcgcggggag 
tgcgctcggt 
tatccacaga 
ccaggaaccg 
agcatcacaa 
accaggcgtt 
ccggatacct 
gtaggtatct 
ccgttcagcc 
gacacgactt 
taggcggtgc 
tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 



acaaaactaa 
caactccaag 
atgaaatatt 
gccgctcgag 
gctgatcagc 
tgccttcctt 
ttgcatcgca 
gcaaggggga 
ctfcctgaggc 
gcgcattaag 
ccctagcgcc 
cccgtcaagc 
tcgaccccaa 
cggtttttcg 
ctggaacaac 
tttcggccta 
gtggaatgtg 
tgcaaagcat 
caggcagaag 
ctccgcccat 
taattttttt 
agtgaggagg 
ccattttcgg 
gattgcacgc 
aacagacaat 
ttctttttgt 
ggctatcgtg 
aagcgggaag 
accttgctcc 
ttgatccggc 
ctcggatgga 
cgccagccga 
tgacccatgg 
tcatcgactg 
gtgatattgc 
tcgccgctcc 
cgggactcfcg 
cgattccacc 
ctggatgatc 
tattgcagct 
atttttttca 
ctgtataccg 
gtgaaattgt 
agcctggggt 
tttccagtcg 
aggcggtttg 
cgttcggctg 
atcaggggat 
taaaaaggcc 
aaatcgacgc 
tccccctgga 
gtccgccttt 
cagttcggtg 
cgaccgctgc 
atcgccactg 
tacagagttc 
ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 



acctgtaaca 
tgcatactct 
tgccacatcc 
catgcatcta 
ctcgactgtg 
gaccctggaa 
.ttgtctgagt 
ggattgggaa 
ggaaagaacc 
cgcggcgggt 
cgctcctttc 
tctaaatcgg 
aaaacttgat 
ccctttgacg 
actcaaccct 
ttggttaaaa 
tgtcagttag 
gcatctcaat 
tatgcaaagc 
cccgccccta 
tatttatgca 
cttttttgga 
atctgatcaa 
aggttctccg 
cggctgctct 
caagaccgac 
gctggccacg 
ggactggctg 
tgccgagaaa 
tacctgccca 
agccggtctt 
actgttcgcc 
cgatgcctgc 
tggccggctg 
tgaagagctt 
cgattcgcag 
gggttcgaaa 
gccgccttct 
ctccagcgcg 
tataatggtt 
ctgcattcta 
tcgacctcta 
tatccgctca 
gcctaatgag 
ggaaacctgt 
cgtattgggc 
cggcgagcgg 
aacgcaggaa 
gcgttgctgg 
tcaagtcaga 
agctccctcg 
ctcccttcgg 
taggtcgttc 
gccttatccg 
gcagcagcca 
ttgaagtggt 
ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 



ctaaccatta 
atgtcatttt 
tcttacactt 
gagggcccta 
ccttctagtt 
ggtgccactc 
aggtgtcatt 
gacaatagca 
agctggggct 
gtggtggtta 
gctttcttcc 
ggcatccctt 
tagggtgatg 
ttggagtcca 
atctcggtct 
aatgagctga 
ggtgtggaaa 
tagtcagcaa 
atgcatctca 
actccgccca 
gaggccgagg 
ggcctaggct 
gagacaggat 
gccgcttggg 
gatgccgccg 
ctgtccggtg 
acgggcgttc 
ctattgggcg 
gtatccatca 
ttcgaccacc 
gtcgatcagg 
aggctcaagg 
ttgccgaata 
ggtgtggcgg 
ggcggcgaat 
cgcatcgcct 
tgaccgacca 
atgaaaggtt 
gggatctcat 
acaaataaag 
gttgtggttt 
gctagagctt 
caattccaca 
tgagctaact 
cgtgccagct 
gctcttccgc 
tatcagctca 
agaacatgtg 
cgtttttcca 
ggtggcgaaa 
tgcgctctcc 
gaagcgtggc 
gctccaagct 
gtaactatcg 
ctggtaacag 
ggcctaacta 
ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 



cactaaacgg 
catgggactg 
tttcatacat 
ttctatagtg 
gccagccatc 
ccactgtcct 
ctattctggg 
ggcatgctgg 
ctagggggta 
cgcgcagcgt 
cttcctttct 
tagggttccg 
gttcacgtag 
cgttctttaa 
attcttttga 
tttaacaaaa 
gtccccaggc 
ccaggtgtgg 
attagtcagc 
gttccgccca 
ccgcctctgc 
tttgcaaaaa 
gaggatcgtt 
tggagaggct 
tgttccggct 
ccctgaatga 
cttgcgcagc 
aagtgccggg 
tggctgatgc 
aagcgaaaca 
atgatctgga 
cgcgcatgcc 
tcatggtgga. 
accgctatca 
gggctgaccg 
tctatcgcct 
agcgacgccc 
gggcttcgga 
gctggagttc 
caatagcatc 
gtccaaactc 
ggcgtaatca 
caacatacga 
cacattaatt 
gcattaatga 
ttcctcgctc 
ctcaaaggcg 
agcaaaaggc 
taggctccgc 
cccgacagga 
tgttccgacc 
gctttctcaa 
gggctgtgtg 
tcttgagtcc 
gattagcaga 
cggctacact 
aaaaagagtt 
tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 



tacacaggaa 
gtctggccac 
tgcccaagaa 
tcacctaaat 
tgttgtttgc 
ttcctaataa 
gggtggggtg 
ggatgcggtg 
tccccacgcg 
gaccgctaca 
cgccacgttc 
atttagtgct 
tgggccatcg 
tagtggactc 
tttataaggg 
atttaacgcg 
tccccaggca 
aaagtcccca 
aaccatagtc 
ttctccgccc 
ctctgagcta 
gctcccggga 
l tcgcatgatt 
attcggctat 
gtcagcgcag 
actgcaggac 
tgtgctcgac 
gcaggatctc 
aatgcggcgg 
tcgcatcgag 
cgaagagcat 
cgacggcgag 
aaatggccgc 
ggacatagcg 
cttcctcgtg 
tcttgacgag 
aacctgccat 
atcgttttcc 
ttcgcccacc 
acaaatttca 
atcaatgtat 
tggtcatagc 
gccggaagca 
gcgttgcgct 
atcggccaac 
actgactcgc 
gtaatacggt 
cagcaaaagg 
ccccctgacg 
ctataaagat 
ctgccgctta 
tgctcacgct 
cacgaacccc 
aacccggtaa 
gcgaggtatg. 
agaaggacag 
ggtagctctt 
cagcagatta 
tctgacgctc 
aggatcttca 
tatgagtaaa 
atctgtctat 



6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
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ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 



catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 



tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 



9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10610 



<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> „ . 

<223> Description of Artificial Sequence: Primer 

<400> 17 24 
tgtacaccgg atccggcgca cacc 



<210> 18 
<211> 35 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

cacaacgagc tcaattaatt aattgccaca tcctc 

<210> 19 

<211> 4 

<212> PRT 

<213> adenovirus 

<400> 19 
Thr Leu Trp Thr 
1 

<210> 20 

<211> 12 

<212> PRT 

<213> adenovirus 



<400> 20 m ^ . 0 

Pro Ser Ala Ser Ala Ser Ala Ser Ala Pro Qly Ser 
15 10 

<210> 22 
<211> 327 
<212> DNA 
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<213> adenovirus 



<400> 22 

agatctgaat 

ccagtactct 

tgagcgagtc 

agtcgcaagg 

tggcggaggt 

aggtgaggtg 



tcgagctcgc 
tggatcggaa 
cgcatcgacc 
taggctgagc 
gctgctgatg 
tggcaggctt 



tgttgggctc 
acccgtcggc 
ggatcggaaa 
accgtggcgg 
atgtaattaa 
gagatct 



gcggttgagg 
ctccgaacgg 
acctctcgag 
gcggcagcgg 
agtaggcggt 



<210> 23 
<211> 32480 
<212> DNA 
<213> adenovirus 



acaaactctt 
tactccgcca 
aaaggcgtct 

gtggcggtcg 

cttgagacgg 



cgcggtcttt 60 
ccgagggacc 120 
aaccagtcac 180 
gggttgtttc 240 
cggatggtcg 300 
327 



<400> 23 
catcatcaat 
ttgtgacgtg 
gatgttgcaa 
gtgtgcgccg 
taaatttggg 
agtgaaatct 
aagcttgaat 
tatgcttccg 
cagctatgac 
aggaacttgg 
agggtgtgga 
ttagtcagca 
catgcatctc 
aactccgccc 
agaggccgag 
aggcctaggc 
tcgaacactt 
atcgtcacct 
tctgaacaat 
gaccagaaac 
ggcgagatcg 
. cttaatcgcc 
accgatcgcc 
ccggcaccag 
gtcgtcgtcc 
acctatccca 
tcgctcacat 
gatggcgtta 
gacagtcgtt 
ctcgcggtga 
cggatgagcg 
gatttccatg 
gttcagatgt 
gaaacgcagg 
ggttatgccg 
gaaatcccga 
gaagcagaag 
ctgaacggca 
ggtcaggtca 
tttaacgccg 
cgctacggcc 
aatcgtctga 
gtgcagcgcg 
cacggcgcta 
gtgcagtatg 
gcgcgcgtgg 



aatatacctt 
gcgcggggcg 
gtgtggcgga 

gtgtacacag 

cgtaaccgag 

gaataatttt 

tcgattaatg 

gctcgtatgt 

catgattacg 

ttaggtacct 

aagtccccag 

accaggtgtg 

aattagtcag 

agttccgccc 

gccgcctcgg 

ttttgcaaaa 

tttaagccgt 

gggacatgtt 

ggaaaggcat 

agcacctcga 

atcccgtcgt 

ttgcagcaca 

cttcccaaca 

aagcggtgcc 

cctcaaactg 

ttacggtcaa 

ttaatgttga 

actcggcgtt 

tgccgtctga 

tggtgctgcg 

gcafctttccg 

ttgccactcg 

gcggcgagtt 

tcgccagcgg 

atcgcgtcac 

atctctatcg 

cctgcgatgt 

agccgttgct 

tggatgagca 

tgcgctgttc 

tgtatgtggt 

ccgatgatcc 

atcgtaatca 

atcacgacgc 

aaggcggcgg 

atgaagacca 



attttggatt 

tgggaacggg 

acacatgtaa 

gaagtgacaa 

taagatttgg 

gtgttactca 

tgagttagct 

tgtgtggaat 

aattcggcgc 

tctgaggcgg 

gctccccagc 

gaaagtcccc 

caaccatagt 

attctccgcc 

cctctgagct 

agcttgggat 

agataaacag 

gcagatccat 

tattgccgta 

actgagccgc 

tttacaacgt 

tccccctttc 

gttgcgcagc 

ggaaagctgg 

gcagatgcac 

tccgccgttt 

tgaaagctgg 

tcatctgtgg 

atttgacctg 

ttggagtgac 

tgacgtctcg 

ctttaatgat 

gcgtgactac 

caccgcgcct 

actacgtctg 

tgcggtggtt 

cggtttccgc 

gattcgaggc 

gacgatggtg 

gcattatccg 

ggatgaagcc 

gcgctggcta 

cccgagtgtg 

gctgtatcgc 

agccgacacc 

gcccttcccg 



gaagccaata 

gcgggtgacg 

gcgacggatg 

ttttcgcgcg 

ccattttcgc 

tagcgcgtaa 

cactcattag 

tgtgagcgga 

agcaccatgg 

aaagaaccag 

aggcagaagt 

aggctcccca 

cccgccccta 

ccatggctga 

attccagaag 

ctctataatc 

gctgggacac 

gcacgtaaac 

agccgtggcg 

gatattgccc 

cgtgactggg 

gccagctggc 

ctgaatggcg 

ctggagtgcg 

ggttacgatg 

gttcccacgg 

ctacaggaag 

tgcaacgggc 

agcgcatttt 

ggcagttatc 

ttgctgcata 

gatttcagcc 

ctacgggtaa 

ttcggcggtg 

aacgtcgaaa 

gaactgcaca 

gaggtgcgga 

gttaaccgtc 

caggatatcc 

aaccatccgc 

aatattgaaa 

ccggcgatga 

atcatctggt 

tggatcaaat 

acggccaccg 

gctgtgccga 



tgataatgag 

tagtagtgtg 

tggcaaaagt 

gttttaggcg 

gggaaaactg 

tctctagcat 

gcaccccagg 

taacaatttc 

cctgaaataa 

ctgtggaatg 

atgcaaagca 

gcaggcagaa 

actccgccca 

ctaatttttt 

tagtgaggag 

tcgcgcaacc 

ttcacatgag 

tcgcaagccg 

gtctggtacc 

agcgtttcaa 

aaaaccctgg 

gtaatagcga 

aatggcgctt 

atcttcctga 

cgcccatcta 

agaatccgac 

gccagacgcg 

gctgggtcgg 

tacgcgccgg 

tggaagatca 

aaccgactac 

gcgctgtact 

cagtttcttt 

aaattatcga 

acccgaaact 

ccgccgacgg 

ttgaaaatgg 

acgagcatca 

tgctgatgaa 

tgtggtacac 

cccacggcat 

gcgaacgcgt 

cgctggggaa 

ctgtcgatcc 

atattatttg 

aatggtccat 



ggggtggagt 

gcggaagtgt 

gacgtttttg 

gatgttgtag 

aataagagga 

cgatgtcgac 

ctttacactt 

acacaggaaa 

cctctgaaag 

tgtgtcagtt 

tgcatctcaa 

gtatgcaaag 

tcccgcccct 

ttatttatgc 

gcttttttgg 

tattttcccc 

cgaaaaatac 

actgatgcct 

ggtgggtgaa 

cgcgctgtat 

cgttacccaa 

agaggcccgc 

tgcctggttt 

ggccgatact 

caccaacgta 

gggttgttac 

aattattttt 

ttacggccag 

agaaaaccgc 

ggatatgtgg 

acaaatcagc 

ggaggctgaa 

atggcagggt 

tgagcgtggt 

gtggagcgcc 

cacgctgatt 

tctgctgctg 

tcctctgcat 

gcagaacaac 

gctgtgcgac 

ggtgccaatg 

aacgcgaatg 

tgaatcaggc 

ttcccgcccg 

cccgatgtac 

caaaaaatgg 



60. 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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ctttcgctac ctggagagac gcgcccgctg 
aacagtcttg gcggtttcgc taaatactgg 
ggcggcttcg tctgggactg ggtggatcag 
ccgtggtcgg cttacggcgg tgattttggc 
aacggtctgg tctttgccga ccgcacgccg 
cagcagtttt tccagttccg tttatccggg 
ttccgtcata gcgataacga gctcctgcac 
gcaagcggtg aagtgcctct ggatgtcgct 
gaactaccgc agccggagag cgccgggcaa 
aacgcgaccg catggtcaga agccgggcac 
gaaaacctca gtgtgacgct ccccgccgcg 
gaaatggatt tttgcatcga gctgggtaat 
tttctttcac agatgtggat tggcgataaa 
ttcacccgtg caccgctgga taacgacatt 
aacgcctggg tcgaacgctg gaaggcggcg 
cagtgcacgg cagatacact tgctgatgcg 
catcagggga aaaccttafcfc tatcagccgg 
atggcgatta ccgttgatgt tgaagtggcg 
ctgaactgcc agctggcgca ggtagcagag 
gaaaactatc ccgaccgcct tactgccgcc 
gacatgtata ccccgtacgt cttcccgagc 
ttgaattatg gcccacacca gtggcgcggc 
caacagcaac tgatggaaac cagccatcgc 
ctgaatatcg acggtttcca tatggggatt 
tcggcggaat tccagctgag cgccggtcgc 
taataataac cgggcaggcc atgtctgccc 
tatttaaaaa acacaaactt ttggatgttc 
atgggagcct acttcccgtt tttcccgatt 
agtgatacgg gtattatttt tgccgctatt 
tttggtctgc tttctgacaa actcggaact 
aaagcaatag catcacaaat ttcacaaata 
gtttgtccaa actcatcaat gtatcttatc 
agggtgggaa agaatatata aggtgggggt 
gccgccgccg ccatgagcac caactcgttt 
acgcgcatgc ccccatgggc cggggtgcgt 
cgccccgtcc tgcccgcaaa ctctactacc 
ttggagactg cagcctccgc cgccgcttca 
actgactttg ctttcctgag cccgcttgca 
gatgacaagt tgacggctct tttggcacaa 
gtttctcagc agctgttgga tctgcgccag 
cccaatgcgg tttaaaacat aaataaaaaa 
gtgtcttgct gtctfctattt aggggttttg 
cggtcgttga gggtcctgtg tattttttcc 
agatacatgg gcataagccc gtctctgggg 
tgcggggtgg tgttgtagat gatccagtcg 
atgtctttca gtagcaagct gattgccagg 
cggttaagct gggatgggtg catacgtggg 
aggttggcta tgttcccagc catatccctc 
acagtgtatc cggtgcactt gggaaatttg 
aacttggaga cgcccttgtg acctccaaga 
atgggcccac gggcggcggc ctgggcgaag 
tgttccagga tgagatcgtc ataggccatt 
tgcggtataa tggttccatc cggcccaggg 
cacgctttga gttcagatgg ggggatcatg 
tccggggtag gggagatcag ctgggaagaa 
cagccggtgg gcccgtaaat cacacctatt 
cagctgccgt catccctgag caggggggcc 
ttttccctga ccaaatccgc cagaaggcgc 
gaagcaaagt ttttcaacgg tttgagaccg 
ccaagcagtt ccaggcggtc ccacagctcg 
atatctcctc gtttcgcggg ttggggcggc 
ccagacgggc cagggtcatg tctttccacg 



atcctttgcg aatacgccca cgcgatgggt 2820 
caggcgtttc gtcagtatcc ccgtttacag 2 880 
tcgctgatta aatatgatga aaacggcaac 2940 
gatacgccga acgatcgcca gttctgtatg 3000 
catccagcgc tgacggaagc aaaacaccag 3060 
caaaccatcg aagtgaccag cgaatacctg 3120 
tggatggtgg cgctggatgg taagccgctg 3180 
ccacaaggta aacagttgat tgaactgcct 3240 
ctctggctca cagtacgcgt agtgcaaccg 3300 
atcagcgcct ggcagcagtg gcgtctggcg 3360 
tcccacgcca tcccgcatct gaccaccagc 3420 
aagcgttggc aatttaaccg ccagtcaggc 3480 
aaacaactgc tgacgccgct gcgcgatcag 3540 
ggcgtaagtg aagcgacccg cattgaccct 3 600 
ggccattacc aggccgaagc agcgttgttg 3 660 
gtgctgatta cgaccgctca cgcgtggcag, 3720 
aaaacctacc ggattgatgg tagtggtcaa 3780 
agcgatacac cgcatccggc gcggattggc 3 84 0 
cgggtaaact ggctcggatt agggccgcaa 3 900 
tgttttgacc gctgggatct gccattgtca 3 960 
gaaaacggtc tgcgctgcgg gacgcgcgaa 4020 
gacttccagt tcaacatcag ccgctacagt 4080 
catctgctgc acgcggaaga aggcacatgg 4140 
ggtggcgacg actcctggag cccgtcagta 4200 
taccattacc agttggtctg gtgtcaaaaa 4260 
gtatttcgcg taaggaaatc cattatgtac 4320 
ggtttattct ttttctttta cttttttatc 4380 
tggctacatg acatcaacca tatcagcaaa 4440 
tctctgttct cgctattatt ccaaccgctg 4500 
tgtttattgc agcttataat ggttacaaat 4560 
aagcattttt ttcactgcat tctagttgtg 4620 
atgtctggat ccagatctgg gcgtggctta 4680 
cttatgtagt tttgtatctg ttttgcagca 4740 
gatggaagca ttgtgagctc atatttgaca 4800 
cagaatgtga tgggctccag cattgatggt 4860 
ttgacctacg agaccgtgtc tggaacgccg 4 920 
gccgctgcag ccaccgcccg cgggattgtg 4980 
agcagtgcag cttcccgttc atccgcccgc 5040 
ttggattctt tgacccggga acttaatgtc 51.00 
caggtttctg ccctgaaggc ttcctcccct 5160 
ccagactctg tttggatttg gatcaagcaa 5220 
cgcgcgcggt aggcccggga ccagcggtct 5280 
aggacgtggt aaaggtgact ctggatgttc 5340 
tggaggtagc accactgcag agcttcatgc 5400 
tagcaggagc gctgggcgtg gtgcctaaaa 5460 
ggcaggccct tggtgtaagt gtttacaaag 5520 
gatatgagat gcatcttgga ctgtattttt 5580 
cggggattca tgttgtgcag aaccaccagc 5640 
tcatgtagct tagaaggaaa tgcgtggaag 5700 
ttttccatgc attcgtccat aatgatggca 5760 
atatttctgg gatcactaac gtcatagttg 5820 
tttacaaagc gcgggcggag ggtgccagac 5880 
gcgtagttac cctcacagat ttgcatttcc 5940 
tctacctgcg gggcgatgaa gaaaacggtt 6000 
agcaggttcc tgagcagctg cgacttaccg 6060 
accgggtgca actggtagtt aagagagctg 6120 
acttcgttaa gcatgtccct gactcgcatg 6180 
tcgccgccca gcgatagcag ttcttgcaag 6240 
tccgccgtag gcatgctttt gagcgtttga 6300 
gtcacctgct ctacggcatc tcgatccagc 6360 
tttcgctgta cggcagtagt cggtgctcgt 6420 
ggcgcagggt cctcgtcagc gtagtctggg 6480 



WO 01/83729 PCT/EP01/04863 

23 



tcacggtgaa 
tgctggtgct 
tggtgtcata 
aggcgccgca 
ccgattccgg 
gccaggtgag 
gtttcttacc 
tgtccccgta 
atagaaactc 
agtgggaggg 
acatgtcgcc 
cgggtgttcc 
ccgcatcgct 
tgacttctgc 
ccgcggtgat 
tgtcaagctt 
gcagggtttg 
attcgcgcgc 
gcacgcgcca 
gtaggcgctc 
ggtctagctg 
gcgcgtcgaa 
cggcaagcgc 
cggaggcgta 
atgtagggta 
agggagcgag 
tctgcctgaa 
tggcgtctgt 
tgaccagctc 
tgtcatactt 
ggtctttcca 
tgtagaactg 
cctgcgcggc 
tgaggtactg 
ccgtgcgctt 
ttcccgcgcg 
tgttaattac 
tgtaaagttc 
aggtgagctc 
ggttggaagc 
gaaaggtcct 
gcgggtcttg 
ctagaggctc 
aggcccccat 
gatgcgagcc 
tgtggtgaaa 
gtgcgcagta 
cgcgcacaag 
cttctacttc 
ggaccaccac 
tgacaacatc 
gcgggagctc 
gatacctaat 
cccgcggcgc 
atgcatctaa 
cgggagaggg 
taggttgctg 
gaagacgacg 
gtcgttgacg 
gatctcggcc 
cacggtggcg 
tccctcgttc 



ggggtgcgct 

gaagcgctgc 
gtccagcccc 
cgaggggcag 
ggagtaggca 
ctctggccgt 
tctggtttcc 
tacagacttg 
ggaccactct 
gtagcggtcg 
ctcttcggca 
tgaagggggg 
gtctgcgagg 
gctaagattg 
gcctttgagg 
ggtggcaaac 
gtttttgtcg 
aacgcaccgc 
accgcggttg 
gttggtccag 
cgtctcgtcc 
gtagtctatc 
gcgctcgtat 
catgccgcaa 
gcatcttcca 
gaggtcggga 
gatggcatgt 
gagacctacc 
ggcggtgacc 
atcctgtccc 
gtactcttgg 
gttgacggcc 
cttccggagc 
gtatttgaag 
tttggaacgc 
aggcataaag 
ctgggcggcg 
caagaagcgc 
ttcaggggag 
gacgaatgag 
aaactggcga 
ttcccagcgg 
atctccgccg 
ccaagtatag 
gatcgggaag 
gtagaagtcc 
ctggcagcgg 
gaagcagagt 
ggctgcttgt 
gccgcgcgag 
gcgcagatgg 
ctgcaggttt 
ttccaggggc 
gactacggta 
aagcggtgac 
ggcaggggca 
gcgaacgcga 
ggcccggtga 
gcggcctggc 
atgaactgct 
gcgaggtcgt 
cagacgcggc 



ccgggctgcg 
cggtcttcgc 
tccgcggcgt 
tgcagacttt 
tccgcgccgc 
tcggggtcaa 
atgagccggt 
agaggcctgt 
gagacaaagg 
ttgtccacta 
tcaaggaagg 
ctataaaagg 
gccagctgtt 
tcagtttcca 
gtggccgcat 
gacccgtaga 
cgatcggcgc 
cattcgggaa 
tgcagggtga 
cagaggcggc 

ggggggtctg 

ttgcatcctt 
gggttgagtg 
atgtcgtaaa 
ccgcggatgc 
ccgaggttgc 
gagttggatg 
gcgtcacgca 
tgcacgtcta 
ttttttttcc 
atcggaaacc 
tggtaggcgc 
gaggtgtggg 
tcagtgtcgt 
ggatttggca 
ttgcgtgtga 
agcacgatct 
gggatgccct 
ctgagcccgt 
ctccacaggt 
cctatggcca 
tcccatccaa 
aacttcatga 
gtctctacat 
aactggatct 
ctgcgacggg 
tgcacgggct 
gggaatttga 
ccttgaccgt 
cccaaagtcc 
gagctgtcca 
acctcgcata 
tggttggtgg 
ccgcgcggcg 
gcgggcgagc 
cgtcggcgcc 
cgacgcggcg 
gcttgagcct 
gcaaaatctc 
cgatctcttc 
tggaaatgcg 
tgtagaccac 



cgctggccag 
cctgcgcgtc 
ggcccttggc 
tgagggcgta 
aggccccgca 
aaaccaggtt 
gtccacgctc 
cctcgagcgg 
ctcgcgtcca 
gggggtccac 
tgattggttt 
gggtgggggc 
ggggtgagta 
aaaacgagga 
ccatctggtc 
gggcgttgga 
gctccttggc 
agacggtggt 
caaggtcaac 
cgcccttgcg 
cgtccacggt 
gcaagtctag 
ggggacccca 
cgtagagggg 
tggcgcgcac 
tacgggcggg 
atatggttgg 
cgaaggaggc 
gggcgcagta 
acagctcgcg 
cgtcggcctc 
agcatccctt 
tgagcgcaaa 
cgcatccgcc 
gggcgaaggt 
tgcggaaggg 
cgtcaaagcc 
tgatggaagg 
gctctgaaag 
cacgggccat 
ttttttctgg 
ggttcgcggc 
ccagcatgaa 
cgtaggtgac 
cccgccacca 
ccgaacactc 
gtacatcctg 
gcccctcgcc 
ctggctgctc 
agatgtccgc 
tggtctggag 
gacgggtcag 
cggcgtcgat 
ggcggtgggc 
ccccggaggt 
gcgcgcgggc 
gttgatctcc 
gaaagagagt 
ctgcacgtct 
ctcctggaga 
ggc cat gage 
gcccccttcg 



ggtgcgcttg 
ggecaggtag 
gcgcagcttg 
gagcttgggc 
gaeggtcteg 
tcccccatgc 
ggtgacgaaa 
tgttccgcgg 
ggccagcacg 
tcgctccagg 
gtaggtgtag 
gcgttcgtcc 
ctccctctga 
ggatttgata 
agaaaagaca 
cagcaacttg 
cgcgatgttt 
gcgctcgtcg 
gctggtggct 
cgagcagaat 
aaagaccccg 
cgcctgctgc 
tggcatgggg 
ctctctgagt 
gtaategtat 
ctgctctgct 
acgctggaag 
gtaggagtcg 
gtccagggtt 
gttgaggaca 
egaaeggtaa 
ttctacgggt 
ggtgtccctg 
ctgctcccag 
gacatcgttg 
tcccggcacc 
gttgatgttg 
caatttttta 
ggcccagtct 
tagcatttgc 
ggtgatgcag 
taggtctege 
gggcacgagc 
aaagagaege 
attggaggag 
gtgctggctt 
cacgaggttg 
tggcgggttt 
gaggggagtt 
gcgcggcggt 
ctcccgcggc 
ggegeggget 
ggcttgcaag 
cgcgggggtg 
agggggggct 
aggagctggt 
tgaatctggc 
tcgacagaat 
cctgagttgt 
tctccgcgtc 
tgegagaagg 
geatcgeggg 



aggctggtcc 
catttgacca 
cccttggagg 
gcgagaaata 
cattccacga 
tttttgatgc 
aggctgtccg 
tcctcctcgt 
aaggaggcta 
gtgtgaagac 
gccacgtgac 
tcactctctt 
aaagegggea 
ttcacctggc 
atctttttgt 
gcgatggagc 
agetgeaegt 
ggcaccaggt 
acctctccgc 
ggcggtaggg 
ggcagcaggc 
catgegeggg 
tgggtgagcg 
attccaagat 
agttcgtgcg 
eggaagacta 
aegttgaage 
cgcagcttgt 
tccttgatga 
aactcttcgc 
gagectagea 
agegegtatg 
accatgactt 
agcaaaaagt 
aagagtatct 
teggaaeggt 
tggcccacaa 
agttcctcgt 
gcaagatgag 
aggtggtcgc 
tagaaggtaa 
geggcagtea 
tgcttcccaa 
teggtgegag 
tggctattga 
ttgtaaaaac 
acctgacgac 
ggctggtggt 
acggtggatc 
eggagcttga 
gtcaggtcag 
agate caggt 
aggccgcatc 
tccttggatg 
ccggacccgc 
gctgcgcgcg 
gcctctgcgt 
caattteggt 
cttgataggc 
cggctcgctc 
cgttgaggcc 
cgcgcatgac 
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cacctgcgcg 
aaagaggtag 
tcgcaacgtg 
gtccacggcg 
aagacggatg 
ttcttcttct 
tgggggaggg 
gatcatctcc 
gcgcagttgg 
cggcagggat 
gagggacctg 
ccagtcacag 
gttgtttctg 
gatggtcgac 
catgccccag 
ttctaccggc 
ggcggcggcg 
gcccctcatc 
ctgcacctgc 
cgtgttgatg 
ctgcgagagc 
gcaagtccgc 
gggccagcgt 
tccgtagatg 
gtcgcggacg 
ctggccggtc 
agcgggcact 
gggttcgagc 
aacccaggtg 
ggcggctgct 
gaaagcgaaa 
agtcgcggga 
ctccccgtca 
ttgcttttcc 
agagcaagag 
ggcgacatcc 
ccggcactac 
tgagcggtac 
gaacctgttt 
cgcagggcgc 
tgagcccgac 
ggtaaccgca 
ccacgtgcgt 
ctttgtaagc 
tatagtgcag 
gcccgagggc 
gcgcagcttg 
caagttttac 
gatcgagggg 
cgtttatcgc 
cgaccgcgag 
agaggccgag 
cctggaggca 
cggcggcgtg 
agcggtgatg 
ctgcagagcc 
atcatgtcgc 
ctctccgcaa 
ctggcgatcg 
gtctacgacg 
ctggaccggc 
cagggcaacc 



agattgagct 
ttgagggtgg 
gattcgttga 
aagttgaaaa 
agctcggcga 
tcaatctcct 
gggacacggc 
ccgcggcgac 
aagacgccgc 
acggcgctaa 
agcgagtccg 
tcgcaaggta 
gcggaggtgc 
agaagcacca 
gcttcgtttt 
acttcttctt 
gagtttggcc 
ggctgaagca 
gtgagggtag 
gtgtaagtgc 
tcggtgtacc 
accaggtact 
agggtggccg 
tacctggaca 
cggttccaga 
aggcgcgcgc 
cttccgtggt 
cccgtatccg 
tgcgacgtca 
gcgctagctt 
gcattaagtg 
cccccggttc 
tgcaagaccc 
cagatgcatc 
cagcggcaga 
gcggttgacg 
ctggacttgg 
ccaagggtgc 
cgcgaccgcg 
gagctgcggc 
gcgcgaaccg 
tacgagcaga 
acgcttgtgg 
gcgctggagc 
cacagcaggg 
cgctggctgc 
agcctggctg 
gcccgcaaga 
ttctacatgc 
aacgagcgca 
ctgatgcaca 
tcctactttg 
gctggggccg 
gaggaatatg 
tttctgatca 
agccgtccgg 
tgactgcgcg 
ttctggaagc 
taaacgcgct 
cgctgcttca 
tggtggggga 
tgggctccat 



ccacgtgccg 
tggcggtgtg 
tatcccccaa 
actgggagtt 
cagtgtcgcg 
cttccataag 
ggcgacgacg 
ggcgcatggt 
ccgtcatgtc 
cgatgcatct 
catcgaccgg 
ggctgagcac 
tgctgatgat 
tgtccttggg 
gacatcggcg 
ctccttcctc 
gtaggtggcg 
gggctaggtc 
actggaagtc 
agttggccat 
tgagacgcga 
ggtatcccac 
gggctccggg 
tccaggtgat 
tgttgcgcag 
aatcgttgac 
ctggtggata 
gccgtccgcc 
gacaacgggg 
ttttggccac 
gctcgctccc 
gagtctcgga 
cgcttgcaaa 
cggtgctgcg 
catgcagggc 
cggcagcaga 
aggagggcga 
agctgaagcg 
agggagagga 
atggcctgaa 
ggattagtcc 
cggtgaacca 
cgcgcgagga 
aaaacccaaa 
acaacgaggc 
tcgatttgat 
acaaggtggc 
tataccatac 
gcatggcgct 
tccacaaggc 
gcctgcaaag 
acgcgggcgc 
gacctgggcfc 
acgaggacga 
gatgatgcaa 
ccttaactcc 
caatcctgac 
ggtggtcccg 
ggccgaaaac 
gcgcgtggct 
tgtgcgcgag 
ggttgcacta 



ggcgaagacg 
ttctgccacg 
ggcctcaagg 
gcgcgccgac 
cacctcgcgc 
ggcctcccct 
gcgcaccggg 
ctcggtgacg 
ccggttatgg 
caacaattgt 
atcggaaaac 
cgtggcgggc 
gtaattaaag 
tccggcctgc 
caggtctttg 
ttgtcctgca 
ccctcttcct 
ggcgacaacg 
atccatgtcc 
aacggaccag 
gtaagccctc 
caaaaagtgc 
ggcgagatct 
gccggcggcg 
cggcaaaaag 
gctctagacc 
aattcgcaag 
gtgatccatg 
gagtgctcct 
tggccgcgcg 
tgtagccgga 
ccggccggac 
ttcctccgga 
gcagatgcgc 
accctcccct 
tggtgattac 
gggcctggcg 
tgatacgcgt 
gcccgaggag 
tcgcgagcgg 
cgcgcgcgca 
ggagattaac 
ggtggctata 
tagcaagccg 
attcagggat 
aaacatcctg 
cgccatcaac 
cccttacgtt 
gaaggtgctt 
cgtgagcgtg 
ggccctggct 
tgacctgcgc 
ggcggtggca 
tgagtacgag 
gacgcaacgg 
acggacgact 
gcgttccggc 
gcgcgcgcaa 
agggccafccc 
cgttacaaca 
gccgtggcgc 
aacgccttcc 



gcgtagtttc 
aagaagtaca 
cgctccatgg 
acggttaact 
tcaaaggcta 
tcttcttctt 
aggcggtcga 
gcgcggccgt 
gttggcgggg 
tgtgtaggta 
ctctcgagaa 
ggcagcgggc 
taggcggtct 
tgaatgcgca 
tagtagtctt 
tctcttgcat 
cccatgcgtg 
cgctcggcta 
acaaagcggt 
ttaacggtct 
gagtcaaata 
ggcggcggct 
tccaacataa 

gtggtggagg 

tgctccatgg 
gtgcaaaagg 
ggtatcatgg 
cggttaccgc 
tttggcttcc 
cagcgtaagc 
gggttatttt 
tgcggcgaac 
aacagggacg 
ccccctcctc 
cctcctaccg 
gaacccccgc 
cggctaggag 
gaggcgtacg 
atgcgggatc 
ttgctgcgcg 
cacgtggcgg 
tttcaaaaaa 
ggactgatgc 
ctcatggcgc 
gcgctgctaa 
cagagcatag 
tattccatgc 
cccatagaca 
accttgagcg 
agccggcggc 
ggcacgggca 
tgggccccaa 
cccgcgcgcg 
ccagaggacg 
acccggcggt 
ggcgccaggt 
agcagccgca 
accccacgca 
ggcccgacga 
gcggcaacgt 
agcgtgagcg 
tgagtacaca 



gcaggcgctg 
taacccagcg 
cctcgtagaa 
cctcctccag 
caggggcctc 
ctggcggcgg 
caaagcgctc 
tctcgcgggg 
ggctgccatg 
ctccgccgcc 
aggcgtctaa 
ggcggtcggg 
tgagacggcg 
ggcggtcggc 
gcatgagcct 
ctatcgctgc 
tgaccccgaa 
atatggcctg 
ggtatgcgcc 
ggtgacccgg 
cgtagtcgtt 
ggcggtagag 
ggcgatgata 
cgcgcggaaa 

tcgggacgct 

agagcctgta 
cggacgaccg 
ccgcgtgtcg 
ttccaggcgc 
ggttaggctg 
ccaagggttg 

gggggtttgc 

agcccctttt 
agcagcggca 
cgtcaggagg 
ggcgccgggc 
cgccctctcc 
tgccgcggca 
gaaagttcca 
aggaggactt 
ccgccgacct 
gctttaacaa 
atctgtggga 
agctgttcct 
acatagtaga 
tggtgcagga 
ttagcctggg 
aggaggtaaa 
acgacctggg 
gcgagctcag 
gcggcgatag 
gccgacgcgc 
ctggcaacgt 
gcgagtacta 
gcgggcggcg 
catggaccgc 
ggccaaccgg 
cgagaaggtg 
ggccggcctg 
gcagaccaac 
cgcgcagcag 
gcccgccaac 
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gtgccgcggg 
gagacaccgc 
caaggcctgc 
gtgcgggctc 
ctgttgctgc 
ctaggtcact 
actttccagg 
gaggcaaccc 
ttaaacagcg 
atgcgcgacg 
ggcatgtatg 
gccgccgtga 
cctggtttct 
gacatagacg 
gagcaggcag 
ctaggcgctg 
cttaccagca 
tcgctgctgc 
gagagcctag 
ccaggcccgc 
gaggacgatg 
tttgcgcacc 
aataaaaaac 
tgcggcgcgc 
cggcgccagt 
tgcctccgcg 
cacccctatt 
ccctgaacta 
acagcccggg 
gcgacctgaa 
ataagtttaa 
tgaaatacga 
tagaccttat 
ttctggaaag 
ccgtcactgg 
ttttgctgcc 
tccgcaagcg 
gtaacattcc 
aacagggcgg 
ccaacgcggc 
gcgacacctt 
ctgccgcccc 
ccctgacaga 
cccagtaccg 
catggaccct 
tgccagacat 
cggtggtggg 
tctactccca 
agaaccagat 
ctgctctcac 
tgaccattac 
tctcgccgcg 
ccagcaataa 
agcgctccga 
acaaacgcgg 
aggcgcgcaa 
ccgtggtgcg 
gtcgccaccg 
gcgcacgtcg 
ttgtcactgt 
gtgctatgac 
tgcgcgtgcc 



gacaggagga 
aaagtzgaggt 
agaccgtaaa 
ccacaggcga 
tgctaatagc 
tgctgacact 
agattacaag 
taaactacct 
aggaggagcg 
gggtaacgcc 
cctcaaaccg 
accccgagta 
acaccggggg 
acagcgtgtt 
aggcggcgct 
cggccccgcg 
ctcgcaccac 
agccgcagcg 
tggacaagat 
gcccgcccac 
actcggcaga 
ttcgccccag 
tcaccaaggc 
ggcgatgtat 
ggcggcggcg 
gtacctgcgg 
cgacaccacc 
ccagaacgac 
ggaggcaagc 
aaccatcctg 
ggcgcgggtg 

gtgggtggag 

gaacaacgcg 
cgacatcggg 
tcttgtcatg 
aggatgcggg 
gcaacccttc 
cgcactgttg 

gggtggcgca 

agccgcggca 
tgccacacgg 
cgctgcgcaa 
ggacagcaag 
cagctggtac 
gctttgcact 
gatgcaagac 
cgccgagctg 
actcatccgc 
tttggcgcgc 
agatcacggg 
tgacgccaga 
cgtcctatcg 
cacaggctgg 
ccaacaccca 
ccgcactggg 
ctacacgccc 
cggagcccgg 
ccgccgaccc 
caccggccga 
gccccccagg 
tcagggtcgc 
cgtgcgcacc 



ctacaccaac 
gtaccagtct 
cctgagccag 
ccgcgcgacc 
gcccttcacg 
gtaccgcgag 
tgtcagccgc 
gctgaccaac 
cattttgcgc 
cagcgtggcg 
gccgtttatc 
tttcaccaat 
attcgaggtg 
ttccccgcaa 
gcgaaaggaa 
gtcagatgct 
ccgcccgcgc 
cgaaaaaaac 
gagtagatgg 
ccgtcgtcaa 
cgacagcagc 
gctggggaga 
catggcaccg 
gaggaaggtc 
ctgggttctc 
cctaccgggg 
cgtgtgtacc 
cacagcaact 
acacagacca 
cataccaaca 
atggtgtcgc 
ttcacgctgc 
atcgtggagc 
gtaaagtttg 
cctggggtat 
gtggacttca 
caggagggct 
gatgtggacg 
ggcggcagca 
atgcagccgg 
gctgaggaga 
cccgaggtcg 
aaacgcagtt 
cttgcataca 
cctgacgtaa 
cccgtgacct 
ttgcccgtgc 
cagtttacct 
ccgccagccc 
acgctaccgc 
cgccgcacct 
agccgcactt 
ggcctgcgct 
gtgcgcgtgc 
cgcaccaccg 
acgccgccac 
cgctatgcta 
ggcactgccg 
cgggcggcca 
tccaggcgac 
aggggcaacg 
cgccccccgc 



tttgtgagcg 
gggccagact 
gctttcaaaa 
gtgtctagct 
gacagtggca 
gccataggtc 
gcgctggggc 
cggcggcaga 
tacgtgcagc 
ctggacatga 
aaccgcctaa 
gccatcttga 
cccgagggta 
ccgcagaccc 
agcttccgca 
agtagcccat 
ctgctgggcg 
ctgcctccgg 
aagacgtacg 
aggcacgacc 
gtcctggatt 
atgttttaaa 
agcgttggtt 
ctcctccctc 
ccttcgatgc 
ggagaaacag 
tggtggacaa 

ttctgaccac 
tcaatcttga 
tgccaaatgt 
gcttgcctac 
ccgagggcaa 
actacttgaa 
acacccgcaa 
atacaaacga 
cccacagccg 
ttaggatcac 
cctaccaggc 
acagcagtgg 
tggaggacat 
agcgcgctga 
agaagcctca 
acaacctaat 
actacggcga 
cctgcggctc 
tccgctccac 
actccaagag 
ctctgaccca 
ccaccatcac 
tgcgcaacag 
gcccctacgt 
tttgagcaag 
tcccaagcaa 
gcgggcacta 
tcgatgacgc 
cagtgtccac 
aaatgaagag 
cccaacgcgc 
tgcgggccgc 
gagcggccgc 
tgtattgggt 
gcaactagat 



cactgcggct 
attttttcca 
acttgcaggg 
tgctgacgcc 
gcgtgtcccg 
aggcgcatgt 
aggaggacac 
agatcccctc 
agagcgtgag 
ccgcgcgcaa 
tggactactt 
acccgcactg 
acgatggatt 
tgctagagtt 
ggccaagcag 
ttccaagctt 
aggaggagta 
catttcccaa 
cgcaggagca 
gtcagcgggg 
tgggagggag 
aaaaaaaaag 
ttctfcgtatt 
ctacgagagt 
tcccctggac 
catccgttac 
caagtcaacg 
ggtcattcaa 
cgaccggtcg 
gaacgagttc 
taaggacaat 
ctactccgag 
agtgggcaga 
cttcagactg 
agccttccat 
cctgagcaac 
ctacgatgat 
gagcttgaaa 
cagcggcgcg 
gaacgatcat 
ggccgaagca 
gaagaaaccg 
aagcaatgac 
ccctcagacc 
ggagcaggtc 
gcgccagatc 
cttctacaac 
cgtgttcaat 
caccgtcagt 
catcggagga 
ttacaaggcc 
catgtccatc 
gatgtttggc 
ccgcgcgccc 
catcgacgcg 
agtggacgcg 
acggcggagg 
ggcggcggcc 
tcgaaggctg 
cgcagcagcc 
gcgcgactcg 
tgcaagaaaa 



aatggtgact 
gaccagtaga 
gctgtggggg 
caactcgcgc 
ggacacatac 
ggacgagcat 
gggcagcctg 
gttgcacagt 
ccttaacctg 
catggaaccg 
gcatcgcgcg 
gctaccgccc 
cctctgggac 
gcaacagcgc 
cttgtccgat 
gatagggtct 
cctaaacaac 
caacgggata 
cagggacgtg 
tctggtgtgg 
tggcaacccg 
catgatgcaa 
ccccttagta 
gtggtgagcg 
ccgccgtttg 
tctgagttgg 
gatgtggcat 
aacaatgact 
cactggggcg 
atgtttacca 
caggtggagc 
accatgacca 
cagaacgggg 
gggtttgacc 
ccagacatca 
ttgttgggca 
ctggagggtg 
gatgacaccg 
gaagagaact 
gccattcgcg 
gcggccgaag 
gtgatcaaac 
agcaccttca 
ggaatccgct 
tactggtcgt 
agcaactttc 
gaccaggccg 
cgctttcccg 
gaaaacgttc 
gtccagcgag 
ctgggcatag 
cttatatcgc 
ggggccaaga 

tggggcgcgc 

gtggtggagg 
gccattcaga 
cgcgtagcac 
ctgcttaacc 
gccgcgggta 
gcggccatta 
gttagcggcc 
aactacttag 
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actcgtactg 
aaatcaaaga 
aagagcagga 
atgatgaact 
agtggaaagg 
ccggtgagcg 
acctgcttga 
acatgctggc 
tgcagcaggt 
ctggtgactt 
tcttggaaaa 
agcaggtggc 
gcaccagrtat 
cggtggcgga 
tgcaaacgga 
agtacggcgc 
cccccggcta 
ccaccactgg 
tgcgcagggt 
ccagcatcgt 
tccgtttccc 
acggcctgac 
gcatgcgcgg 
tgcccggaat 
tgtggaaaaa 
tgtagaatgg 
atgggaaact 
tcgctgtgga 
tggaacagca 
aaggtggtag 
gtgcaaaata 
gccgtggaga 
gaaactctgg 
ctgcccacca 
gtaacgctgg 
accgccgttg 
cgatcgttgc 
ctgggggtgc 
tgtcatgtat 
ccaagatggc 
acgcctcgga 
tcagcctgaa 
accggtccca 
cgtacaaggc 
cgtactttga 
ctgcctacaa 
ctactgctct 
agcaagctga 
ttacaaagga 
catttcaacc 
cagctgggag 
aacccacaaa 
aaagtcaagt 
acttgactcc 
atatttctta 
ctatgcccaa 
acaacagcac 
tagatttgca 
atagaaccag 
gaattattga 
gtgtgattaa 
. gggaaaaaga 



ttgtatgtat 
agagatgctc 
ttacaagccc 
tgacgacgag 
tcgacgcgta 
ctccacccgc 
gcaggccaac 
gttgccgctg 
gctgcccgcg 
ggcacccacc 
aatgaccgtg 
gccgggactg 
tgccaccgcc 
tgccgcggtg 
cccgtggatg 
cgccagcgcg 
tcgtggctac 
aacccgccgc 
ggctcgcgaa 
ttaaaagccg 
ggtgccggga 
gggcggcatg 
cggtatcctg 
tgcatccgtg 
tcaaaataaa 
aagacatcaa 
ggcaagatat 
gcggcattaa 
gcacaggcca 
atggcctggc 
agattaacag 
cagtgtctcc 
tgacgcaaat 
cccgtcccat 
acctgcctcc 
ttgtaacccg 
ggcccgtagc 
aatccctgaa 
gcgtccatgt 
taccccttcg 
gtacctgagc 
taacaagttt 
gcgtttgacg 
gcggttcacc 
catccgcggc 
cgccctggct 
tgaaataaac 
gcagcaaaaa 
gggtattcaa 
tgaacctcaa 
agtccttaaa 
tgaaaatgga 
ggaaatgcaa 
taaagtggta 
catgcccact 
caggcctaat 
gggtaatatg 
agacagaaac 
gtacttttct 
aaatcatgga 
tacagagact 
tgctacagaa 



ccagcggcgg 
caggtcatcg 
cgaaagctaa 
gtggaactgc 
aaacgtgttt 
acctacaagc 
gagcgcctcg 
gacgagggca 
cttgcaccgt 
gtgcagctga 
gaacctgggc 
ggcgtgcaga 
acagagggca 
caggcggtcg 
tttcgcgttt 
ctactgcccg 
acctaccgcc 
cgccgtcgcc 
ggaggcagga 
gtctttgtgg 
ttccgaggaa 
cgtcgtgcgc 
cccctcctta 
gccttgcagg 
aagtctggac 
ctttgcgtct 
cggcaccagc 
aaatttcggt 
gatgctgagg 
ctctggcatt 
taagcttgat 
agaggggcgt 
agacgagcct 
cgcgcccatg 
ccccgccgac 
tcctagccgc 
cagtggcaac 
gcgccgacga 
cgccgccaga 
atgatgccgc- 
cccgggctgg 
agaaacccca 
ctgcggttca 
ctagctgtgg 
gtgctggaca 
cccaagggtg 
ctagaagaag 
actcacgtat 
ataggtgtcg 
ataggagaat 
aagactaccc 
gggcaaggca 
tttttctcaa 
ttgtacagtg 
attaaggaag 
tacattgctt 
ggtgttctgg 
acagagcttt 
atgtggaatc 
actgaagatg 
cttaccaagg 
ttttcagata 



cggcgcgcaa 
cgccggagat 
agcgggtcaa 
tgcacgctac 
tgcgacccgg 
gcgtgtatga 
gggagtttgc 
acccaacacc 



ccgaagaaaa 
tggtacccaa 
tggagcccga 
ccgtggacgt 
tggagacaca 
ctgcggccgc 
cagccccccg 
aatatgccct 
ccagaagacg 
gtcgccagcc 
ccctggtgct 
ttcttgcaga 
gaatgcaccg 
accaccggcg 
ttccactgat 
cgcagagaca 
tctcacgctc 
ctggccccgc 
aatatgagcg 
tccaccgtta 
gataagttga 
agcggggtgg 
ccccgccctc 
ggcgaaaagc 
ccctcgtacg 
gctaccggag 
acccagcaga 
gcgt.ccctgc 
tggcaaagca 
tgcttctgaa 
ggagctgctg 
■ agtggtctta 
tgcagtttgc 
cggtggcgcc 
tccctgtgga 
gtgataaccg 
ggggccctac 
ccccaaatcc 
aggacgatga 
ttgggcaggc 
aaggtcaaac 
ctcagtggta 
caatgaaacc 
ttcttgtaaa 
ctactgaggc 
aagatgtaga 
gtaactcacg 
ttagggacaa 
cgggccaagc 
cataccagct 
aggctgttga 
aacttccaaa 
taaaacctaa 
aaaatgaaat 



cgaagctatg 
ctatggcccc 
aaagaaaaag 
cgcgcccagg 
caccaccgta 
tgaggtgtac 
ctacggaaag 
tagcctaaag 
gcgcggccta 
gcgccagcga 
ggtccgcgtg 
tcagataccc 
aacgtccccg 
gtccaagacc 
gcgcccgcgc 
acatccttcc 
agcaactacc 
cgtgctggcc 
gccaacagcg 
tatggccctc 
taggaggggc 
gcggcgcgcg 
cgccgcggcg 
ctgattaaaa 
gcttggtcct 
gacacggctc 
gtggcgcctt 
agaactatgg 
aagagcaaaa 
tggacctggc 
ccgtagagga 
gtccgcgccc 
aggaggcact 
tgctgggcca 
aacctgtgct 
gccgcgccgc 
cactgaacag 
tagctaacgt 
agccgccgcg 
catgcacatc 
ccgcgccacc 
tacgcacgac 
ccgtgaggat 
tgtgctggac 
ttttaagccc 
ttgcgaatgg 
caacgaagac 
gccttattct 
acctaaatat 
cgaaactgaa 
atgttacggt 
gcaacaaaat 
gaccgcaggc 
tatagaaacc 
agaactaatg 
ttttattggt 
atcgcagttg 
tttgcttgat 
cagctatgat 
ttactgcttt 
aacaggtcag 
aagagttgga 



tccaagcgca 
ccgaagaagg 
aaagatgatg 
cgacgggtac 
gtctttacgc 
ggcgacgagg 
cggcataagg 
cccgtaacac 
aagcgcgagt 
ctggaagatg 
cggccaatca 
actaccagta 
gttgcctcag 
tctacggagg 
ggttcgagga 
attgcgccta 
cgacgccgaa 
ccgatttccg 
cgctaccacc 
acctgccgcc 
atggccggcc 
tcgcaccgtc 
attggcgccg 
acaagttgca 
gtaactattt 
gcgcccgttc 
cagctggggc 
cagcaaggcc 
tttccaacaa 
caaccaggca 
gcctccaccg 
cgacagggaa 
aaagcaaggc 
gcacacaccc 
gccaggcccg 
cagcggtccg 
catcgtgggt 
gtcgtatgtg 
cgcccgcfctt 
tcgggccagg 
gagacgtact 
gtgaccacag 
actgcgtact 
atggcttcca 
tactctggca 
gatgaagctg 
gaagtagacg 
ggtataaata 
gccgataaaa 
attaatcatg 
tcatatgcaa 
ggaaagctag 
aatggtgata 
ccagacactc 
ggccaacaat 
ctaatgtatt 
aatgctgttg 
tccattggtg 
ccagatgtta 
ccactgggag 
gaaaatggat 
aataattttg 
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ccatggaaat 
tgtatttgcc 
acacctacga 
accttggagc 
gcaatgctgg 
acatccaggt 
acacctacga 
atgacctaag 
tcttccccat 
acgaccagtc 
acgctaccaa 
ccttcacgcg 
acacctactc 
agaaggtggc 
cccccaacga 
gtaacatgac 
agggcttcta 
agcccatgag 
tcctacacca 
gacaggccta 
ttacccagaa 
ttatgtccat 
acgcgctaga 
tgtttgaagt 
tgtacctgcg 
aacaacagct 
tggttgtggg 
acacaagctc 
gatggccttt 
ttctgaccag 
cgccattgct 
ggggcccaac 
ctggccccaa 
ctccatgctc 
cagcttcctg 
cacttctttt 
aggcaaatgc 
cgccgtttaa 
gttgcgatac 
ggtgaagttt 
cgatatcttg 
gttgcagcac 
ggagatcaga 
tagctgcctt 
catcaaaagg 
gatctgctta 
gccggaaaac 
ggagatctgc 
ctccttcagc 
atttatcata 
cagccacaac 
caggtacgcc 
cagctgcaac 
cacttggtca 
catcagcgcg 
cgggttcatc 
ccgcatacca 
tttgccatgc 
ttctctttct 
agaagggcgc 
ccgcgggctg 
ctcgatacgc 



caatctaaat 
cgacaagcta 
ctacatgaac 
acgctggtcc 
cctgcgctac 
gcctcagaag 
gtggaacttc 
ggttgacgga 
ggcccacaac 
ctttaacgac 
cgtgcccata 
ccttaagact 
tggctctata 
cattaccttt 
gtttgaaatt 
caaagactgg 
tatcccagag 
ccgtcaggtg 
acacaacaac 
ccctgctaac 
aaagtttctt 
gggcgcactc 
catgactttt 
ctttgacgtg 
cacgcccttc 
gccgccatgg 
ccatattttt 
gcctgcgcca 
gcctggaacc 
cgactcaagc 
tcttcccccg 
tcggccgcct 
actcccatgg 
aacagtcccc 
gagcgccact 
tgtcacttga 
ttttatttgt 
aaatcaaagg 
tggtgtttag 
tcactccaca 
aagtcgcagt 
tggaacacta 
tccgcgtcca 
cccaaaaagg 
tgaccgtgcc 
aaagccacct 
tgattggccg 
accacatttc 
gcgcgctgcc 
atgcttccgt 
gcgcagcccg 
tgcaggaatc 
ccgcggtgct 
ggcagtagtt 
cgcgcagcct 
accgtaattt 
cgcgccactg 
ttgattagca 
tcctcgctgt 
ttctctttct 
ggtgtgcgcg 
cgcctcatcc 



gccaacctgt 
aagtacagtc 
aagcgagtgg 
cttgactata 
cgctcaatgt 
ttctttgcca 
aggaaggatg 
gccagcatta 
accgcctcca 
tatctctccg 
tccatcccct 
aaggaaaccc 
ccctacctag 
gactcttctg 
aagcgctcag 
ttcctggtac 
agctacaagg 
gtggatgata 
tctggatttg 
ttcccctatc 
tgcgatcgca 
acagacctgg 
gaggtggatc 
gtccgtgtgc 
tcggccggca 
gctccagtga 
tgggcaccta 
tagtcaatac 
cgcactcaaa 
aggtttacca 
accgctgtat 
gtggactatt 
atcacaaccc 
aggtacagcc 
cgccctactt 
aaaacatgta 
acactctcgg 
ggttctgccg 
tgctccactt 
ggctgcgcac 
tggggcctcc 
tcagcgccgg 
ggtcctccgc 
gcgcgtgccc 
cggtctgggc 
gagcctttgc 
gacaggccgc 
ggccccaccg 
cgttttcgct 
gtagacactt 
tgggctcgtg 
gccccatcat 
cctcgttcag 
tgaagttcgc 
ccatgccctt 
cactttccgc 
ggtcgtcttc 
ccggtgggtt 
ccacgattac 
tcttgggcgc 
gcaccagcgc 
gcttttttgg 



ggagaaattt 
cttccaacgt 
tggctcccgg 
tggacaacgt 
tgctgggcaa 
ttaaaaacct 
ttaacatggt 
agtttgatag 
cgcttgaggc 
ccgccaacat 
cccgcaactg 
catcactggg 
atggaacctt 
tcagctggcc 
ttgacgggga 
aaatgctagc 
accgcatgta 
ctaaatacaa 
ttggctacct 
cgcttatagg 
ccctttggcg 
gccaaaacct 
ccatggacga 
accggccgca 
acgccacaac 
gcaggaactg 
tgacaagcgc 
ggccggtcgc 
aacatgctac 
gtttgagtac 
aacgctggaa 
ctgctgcatg 
caccatgaac 
caccctgcgt 
ccgcagccac 
aaaataatgt 
gtgattattt 
cgcatcgcta 
aaactcaggc 
catcaccaac 
gccctgcgcg 
gtggtgcacg 
gttgctcagg 
aggctttgag 
gttaggatac 
gccttcagag 
gtcgtgcacg 
gttcttcacg 
cgtcacatcc 
aagctcgcct 
atgcttgtag 
cgtcacaaag 
ccaggtcttg 
ctttagatcg 
ctcccacgca 
ttcgctgggc 
attcagccgc 
gctgaaaccc 
ctctggtgat 
aatggccaaa 
gtcttgtgat 
gggcgcccgg 



cctgtactcc 
aaaaatttct 
gttagtggac 
caacccattt 
tggtcgctat 
ccttctcctg 
tctgcagagc 
catttgcctt 
catgcttaga 
gctctaccct 

ggcggctttc 

ctcgggctac 
ttacctcaac 
tggcaatgac 
gggttacaac 
taactacaac 
ctccttcttt 
ggactaccaa 
tgcccccacc 
caagaccgca 
catcccattc 
tctctacgcc 
gcccaccctt 
ccgcggcgtc 
ataaagaagc 
aaagccattg 
tttccaggct 
gagactgggg 
ctctttgagc 
gagtcactcc 
aagtccaccc 
tttctccacg 
cttattaccg 
cgcaaccagg 
agtgcgcaga 
actagagaca 
acccccaccc 
tgcgccactg 
acaaccatcc 
gcgtttagca 
cgcgagttgc 
ctggccagca 
gcgaacggag 
ttgcactcgc 
agcgcctgca 
aagaacatgc 
cagcaccttg 
atcttggcct 
atttcaatca 
tcgatctcag 
gtcacctctg 
gtcttgttgc 
catacggccg 
ttatccacgt 
gacacgatcg 
tcttcctctt 
cgcactgtgc 
accatttgta 
ggcgggcgct 
tccgccgccg 
gagtcttcct 
ggaggcggcg 



aacatagcgc 
gataacccaa 
tgctacatta 
aaccaccacc 
gtgcccttcc 
ccgggctcat 
tccctaggaa 
tacgccacct 
aacgacacca 
atacccgcca 
cgcggctggg 
gacccttatt 
cacaccttta 
cgcctgctta 
gttgcccagt 
attggctacc 
agaaacttcc 
caggtgggca 
atgcgcgaag 
gttgacagca 
tccagtaact 
aactccgccc 
ctttatgttt 
atcgaaaccg 
aagcaacatc 
tcaaagatct 
ttgtttctcc 
gcgtacactg 
cctttggctt 
tgcgccgtag 
aaagcgtaca 
cctttgccaa 
gggtacccaa 
aacagctcta 
ttaggagcgc 
ctttcaataa 
ttgccgtctg 
gcagggacac 
gcggcagctc 
ggtcgggcgc 
gatacacagg 
cgctcttgtc 
tcaactttgg 
accgtagtgg 
taaaagcctt 
cgcaagactt 
cgtcggtgtt 
tgctagactg 
cgtgctcctt 
cgcagcggtg 
caaacgactg 
tggtgaaggt 
ccagagcttc 
ggtacttgtc 
gcacactcag 
cctcttgcgt 
gcttacctcc 
gcgccacatc 
cgggcttggg 
aggtcgatgg 
cgtcctcgga 
gcgacgggga 
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cggggacgac acgtcctcca tggttggggg acgtcgcgcc gcaccgcgtc cgcgctcggg 25140 
ggtggtttcg cgctgctcct cttcccgact ggccatttcc ttctcctata ggcagaaaaa 25200 
gatcatggag tcagtcgaga agaaggacag cctaaccgcc ccctctgagt tcgccaccac 25260 
cgcctccacc gatgccgcca acgcgcctac caccttcccc gtcgaggcac ccccgcttga 2532 0 
•ggaggaggaa gtgattatcg agcaggaccc aggttttgta agcgaagacg acgaggaccg 25380 
ctcagtacca acagaggata aaaagcaaga ccaggacaac gcagaggcaa acgaggaaca 25440 
agtcgggcgg ggggacgaaa ggcatggcga ctacctagat gtgggagacg acgtgctgtt 25500 
gaagcatctg cagcgccagt gcgccattat ctgcgacgcg ttgcaagagc gcagcgatgt 25560 
gcccctcgcc atagcggatg tcagccttgc ctacgaacgc cacctattct caccgcgcgt 25620 
accccccaaa cgccaagaaa acggcacatg cgagcccaac ccgcgcctca acttctaccc 25680 
cgtatttgcc gtgccagagg tgcttgccac ctatcacatc tttttccaaa actgcaagat 25740 
acccctatcc tgccgtgcca accgcagccg agcggacaag cagctggcct tgcggcaggg 25800 
cgctgtcata cctgatatcg cctcgctcaa cgaagtgcca aaaatctttg agggtcttgg 25860 
acgcgacgag aagcgcgcgg caaacgctct gcaacaggaa aacagcgaaa atgaaagtca 25920 
ctctggagtg ttggtggaac tcgagggtga caacgcgcgc ctagccgtac taaaacgcag 25980 
catcgaggtc acccactttg cctacccggc acttaaccta ccccccaagg tcatgagcac 26040 
agtcatgagt gagctgatcg tgcgccgtgc gcagcccctg gagagggatg caaatttgca 2610 0 
agaacaaaca gaggagggcc tacccgcagt tggcgacgag cagctagcgc gctggcttca 26160 
aacgcgcgag cctgccgact tggaggagcg acgcaaacta atgatggccg cagtgctcgt 2622 0 
taccgtggag cttgagtgca tgcagcggtt ctttgctgac ccggagatgc agcgcaagct 26280 
agaggaaaca ttgcactaca cctttcgaca gggctacgta cgccaggcct gcaagatctc 26340 
caacgtggag ctctgcaacc tggtctccta ccttggaatt ttgcacgaaa accgccttgg 26400 
gcaaaacgtg cttcattcca cgctcaaggg cgaggcgcgc cgcgactacg tccgcgactg 26460 
cgtttactta tttctatgct acacctggca gacggccatg ggcgtttggc agcagtgctt 26520 
ggaggagtgc aacctcaagg agctgcagaa actgctaaag caaaacttga aggacctatg 26580 
gacggccttc aacgagcgct ccgtggccgc gcacctggcg gacatcattt tccccgaacg 26640 
cctgcttaaa accctgcaac agggtctgcc agacttcacc agtcaaagca tgttgcagaa 26700 
ctttaggaac tttatcctag agcgctcagg aatcttgccc gccacctgct gtgcacttcc 26760 
tagcgacttt gtgcccatta agtaccgcga atgccctccg ccgctttggg gccactgcta 26820 
ccttctgcag ctagccaact accttgccta ccactctgac ataatggaag acgtgagcgg 26880 
tgacggtcta ctggagtgtc actgtcgctg caacctatgc accccgcacc gctccctggt 26940 
ttgcaattcg cagctgctta acgaaagtca aattatcggt acctttgagc tgcagggtcc 27000 
ctcgcctgac gaaaagtccg cggctccggg gttgaaactc actccggggc tgtggacgtc 27060 
ggcttacctt cgcaaatttg tacctgagga ctaccacgcc cacgagatta ggttctacga 27120 
agaccaatcc cgcccgccaa atgcggagct taccgcctgc gtcattaccc agggccacat 27180 
tcttggccaa ttgcaagcca tcaacaaagc ccgccaagag tttctgctac gaaagggacg 27240 
gggggtttac ttggaccccc agtccggcga ggagctcaac ccaatccccc cgccgccgca 27300 
gccctatcag cagcagccgc gggcccttgc ttcccaggat ggcacccaaa aagaagctgc 27360 
agctgccgcc gccacccacg gacgaggagg aatactggga cagtcaggca gaggaggttt 27420 
tggacgagga ggaggaggac atgatggaag actgggagag cctagacgag gaagcttccg 27480 
aggtcgaaga ggtgtcagac gaaacaccgt caccctcggt cgcattcccc tcgccggcgc 27540 
cccagaaatc ggcaaccggt tccagcatgg ctacaacctc cgctcctcag gcgccgccgg 27600 
cactgcccgt tcgccgaccc aaccgtagat gggacaccac tggaaccagg gccggtaagt 27660 
ccaagcagcc gccgccgtta gcccaagagc aacaacagcg ccaaggctac cgctcatggc 27720 
gcgggcacaa gaacgccata gttgcttgct tgcaagactg tgggggcaac atctcctbcg 27780 
cccgccgctt tcttctctac catcacggcg tggccttccc ccgtaacatc ctgcattact 27840 
accgtcatct ctacagccca tactgcaccg gcggcagcgg cagcggcagc aacagcagcg 27900 
gccacacaga agcaaaggcg accggatagc aagactctga caaagcccaa gaaatccaca 27960 
gcggcggcag cagcaggagg aggagcgctg cgtctggcgc ccaacgaacc cgtatcgacc 28020 
cgcgagctta gaaacaggat ttttcccact ctgtatgcta tatttcaaca gagcaggggc 28080 
caagaacaag agctgaaaat aaaaaacagg tctctgcgat ccctcacccg cagctgcctg 28140 
tatcacaaaa gcgaagatca gcttcggcgc acgctggaag acgcggaggc tctcttcagt 28200 
aaatactgcg cgctgactct taaggactag tttcgcgccc tttctcaaat ttaagcgcga 28260 
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ttcacgcctc 
ggcattggaa 
tcgggacctc 
tcggcggacg 
ctggtccact 
gaattgcccg 
gagcttgccc 
aggggaccct 
ttaattaatt 
ttgtgttatg 
tcagtagtat 
acagaaccct 
ctccccggct 
tattccacac 
gcagctcact 
gcggttgctt 
cgtgcatcag 
gctccgtcct 
gcagcataag 
cacagtaact 
atccaaagct 
agattaagtg 
tgtaattcac 
ccatcctaaa 
tggaacaatg 
tatcaatgtt 
cccgcgttag 
tgcagggaag 
gcagcggatg 
ccctactgta 
atggaacgcc 
gatctgcgtc 
tctctcaaag 
gctgccctga 
. ttctgcgagt 
ttccaaaaga 
ggtggcgtgg 
aatggcttcc 
agggtgaatc 
tcgccacctt 
aatctgctcc 
tcaggttcct 
tcccgtaggt 
gcggccactt 
ctcggagcta 
aaaatgcaag 
gtagtcatgc 
catttttctc 
aacatttaaa 
cggactacgg 
accgacagct 
tgattcatcg 
gcgtagagac 
cacataaaca 
aacatacagc 
tattaaaaaa 
caagtgcaga 
aacacccaga 
ttcctcaaat 
caattcccaa 
cgccccgcgc 
aaggtatatt 



gtcaggcaat 
ctctgcaatt 
ccggccacta 
gctacgactg 
gtcgccgcca 
aggatcatat 
gtagcctgat 
gtgttctcac 
gccacatcct 
tttcaacgtg 
agccccacca 
agtattcaac 
ggccttaaaa 
ggtttcctgt 
taagttcatg 
aacgggcggc 
gatagggcgg 
gcaggaatac 
gcgccttgtc 
gcagcacagc 
catggcgggg 
gcgacccctc 
cacctcccgg 
ccagctggcc 
acagtggaga 
ggcacaacac 
aaccatatcc 
acctcgcacg 
atcctccagt 
cggagtgcgc 
ggacgtagtc 
tccggtctcg 
catccaggcg 
taacatccac 
cacacacggg 
ttatccaaaa 
tcaaactcta 
aaaaggcaaa 
tcctctataa 
ctcaatatat 
agagcgccct 
cacagacctg 
cccttcgcag 
ccccgccagg 
tgctaaccag 
gtgctgctca 
tcatgcagat 
tcaaacatgt 
cattagaagc 
ccatgccggc 
cctcggtcat 
gtcagtgcta 
aacattacag 
cctgaaaaac 
gcttcacagc 
acaccactcg 
gcgagtatat 
aaaccgcacg 
cgtcacttcc 
cacatacaag 
cacgtcacaa 
attgatgatg 



cctaactctg cagacctcgt cctctgagcc gcgctctgga 28860 
tattgaggag tttgtgccat cggtctactt taaccccttc 28920 
tccggatcaa tttattccta actttgacgc ggtaaaggac 28980 
aatgttaagt ggagaggcag agcaactgcg cctgaaacac 29040 
caagtgcttt gcccgcgact ccggtgagtt ttgctacttt 29100 
cqagggcccg gcgcacggcg tccggcttac cgcccaggga 29160 
tcgggagttt acccagcgcc ccctgctagt tgagcgggac 29220 
tgtgatttgc aactgtccta accttggatt acatcaagat 29280 
cttacacttt ttcatacatt gcccaagaat aaagaatcgt 29340 
tttatttttc aattgcagaa aatttcaagt catttttcat 29400 
ccacatagct tatacagatc accgtacctt aatcaaactc 29460 
ctgccacctc cctcccaaca cacagagtac acagtccttt 29520 
agcatcatat catgggtaac agacatattc ttaggtgtta 29580 
cqagccaaac gctcatcagt gatattaata aactccccgg 29640 
tcgctgtcca gctgctgagc cacaggctgc . tgtccaactt 29700 
qaaggagaag tccacgccta catgggggta gagtcataat 29760 
tggtgctgca gcagcgcgcg aataaactgc tgccgccgcc 29820 
alcatggcag tggtctcctc agcgatgatt cgcaccgccc 29880 
ctccgggcac agcagcgcac cctgatctca cttaaatcag 29940 
accacaatat tgttcaaaat cccacagtgc aaggcgctgt 30000 
accacagaac ccacgtggcc atcataccac aagcgcaggt 30060 
ataaacacgc tggacataaa cattacctct tttggcatgt 30120 
taccatataa acctctgatt aaacatggcg ccatccacca 30180 
aaaacctgcc cgccggctat acactgcagg gaaccgggac 30240 
gcccaggact cgtaaccatg gatcatcatg ctcgtcatga 30300 
aqgcacacgt gcatacactt cctcaggatt acaagctcct 30360 
cagggaacaa cccattcctg aatcagcgta aatcccacac 30420 
taactcacgt tgtgcattgt caaagtgtta cattcgggca 30480 
atggtagcgc gggtttctgt ctcaaaagga ggtagacgat 30540 
cgalacaacc Ulatcgtgt tggtcgtagt gtcatgccaa 30600 
atatttcctg aagcaaaacc aggtgcgggc gtgacaaaca 30660 
ccgcttagat cgctctgtgt agtagttgta gtatatccac 30720 
ccccctggct tcgggttcta tgtaaactcc ttcatgcgcc 30780 
caccgcagaa taaSccacac ccagccaacc tacacattcg 30 840 
Sgagcglga aga^ctggaa gaaccatgtt ttttttttta 30900 
cctcaaaatg aagatctatt aagtgaacgc gctcccctcc 30960 
cagccaaaga acagataatg gcatttgtaa gatgttgcac 31020 
cggccctcac gtccaagtgg acgtaaaggc taaacccttc 31080 
acattccagc accttcaacc atgcccaaat aattctcatc 31140 
ctctaagcaa atcccgaata ttaagtccgg ccattgtaaa 31200 
ccaccttcag cctcaagcag cgaatcatga ttgcaaaaat 31260 
tataagattc aaaagcggaa cattaacaaa aataccgcga 31320 
ggccagctga acataatcgt gcaggtctgc acggaccagc 31380 
aaccttgaca aaagaaccca cactgattat gacacgcata 31440 
cgtagccccg atgtaagctt tgttgcatgg gcggcgatat 31500 
aaaaatcagg caaagcctcg cgcaaaaaag aaagcacatc 31560 
aaaggcaggt aagctccgga accaccacag aaaaagacac 31620 
ctqcgggttt ctgcataaac acaaaataaa ataacaaaaa 31680 
ctitcttaca acaggaaaaa caacccttat aagcataaga 31740 
gtgaccgtaa aaaaactggt caccgtgatt aaaaagcacc 31800 
gtccggagtc ataatgtaag actcggtaaa cacatcaggt 31860 
aaaagcgacc gaaatagccc gggggaatac atacccgcag 31920 
cccccatagg aggtataaca aaattaatag gagagaaaaa 31980 
cctcctgcct aggcaaaata gcaccctccc gctccagaac 32040 
ggcagcctaa cagtcagcct taccagtaaa aaagaaaacc 32100 
acacggcacc agctcaatca gtcacagtgt aaaaaagggc 32160 
ataggactaa aaaatgacgt aacggttaaa gtccacaaaa 32220 
cgaacctacg cccagaaacg aaagccaaaa aacccacaac 32280 
gttttcccac gttacgtaac ttcccatttt aagaaaacta 32340 
ttactccgcc ctaaaaccta cgtcacccgc cccgttccca 32400 
actccacccc ctcattatca tattggcttc aatccaaaat 32460 
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<210> 24 

<211> 25 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 24 

ctcaacaatt gtggatccgt actcc 

<210> 25 • 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 25 

gtgctcagca gatcttgcga ctgtg 



<210> 26 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 26 

ggcgcgttcg gatccactct cttcc 25 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 27 

ctacatgcta ggcagatctc gttcggag *° 



<210> 28 
<211> 1240 
<212> DNA 
<213> adenovirus 



<400> 28 

ggatccactc tcttccgcat cgctgtctgc 
ctgaaaagcg ggcatgactt ctgcgctaag 
gatattcacc tggcccgcgg tgatgccttt 
gacaatcttt ttgttgtcaa gcttggtggc 
cttggcgatg gagcgcaggg tttggttttt 
gtttagctgc acgtattcgc gcgcaacgca 
gtcgggcacc aggtgcacgc gccaaccgcg 
ggctacctct ccgcgtaggc gctcgttggt 
gaatggcggt agggggtcta gctgcgtctc 
cccgggcagc aggcgcgcgt cgaagtagfcc 



gagggccagc tgttggggtg agtactccct 60 
attgtcagtt tccaaaaacg aggaggattt 120 
gagggtggcc gcatccatct ggtcagaaaa 180 
aaacgacccg tagagggcgt tggacagcaa 240 
gtcgcgatcg gcgcgctcct tggccgcgat 300 
ccgccattcg ggaaagacgg tggtgcgctc 360 
gttgtgcagg gtgacaaggt caacgctggt 420 
ccagcagagg cggccgccct tgcgcgagca 480 
gtccgggggg tctgcgtcca cggtaaagac 540 
tatcttgcat ccttgcaagt ctagcgcctg 600 
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ctgccatgcg cgggcggcaa gcgcgcgctc gtatgggttg agtgggggac cccatggcat 660 

ggggtgggtg agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct 720 

gagtattcca agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc 780 

gtatagttcg tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgctc 840 

tgctcggaag actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg 900 

gaagacgttg aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga 960 

gtcgcgcagc ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag 1020 

ggtttccttg atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag 1080 

gacaaactct- tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg 1140 

agatccgtac tccgccgccg agggacctga gcgagtccgc atcgaccgga tcggaaaacc 1200 

tctcgagaaa ggcgtctaac cagtcacagt cgcaagatct 1240 

<210> 29 
<211> 8383 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV60 
<400> 29 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 3 00 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ccactctctt ccgcatcgct gtctgcgagg gccagctgtt ggggtgagta 960 
ctccctctga aaagcgggca tgacttctgc gctaagattg tcagtttcca aaaacgagga 1020 
ggatttgata ttcacctggc 'ccgcggtgat gcctttgagg gtggccgcat ccatctggtc 1080 
agaaaagaca atctttttgt tgtcaagctt ggtggcaaac gacccgtaga gggcgttgga 1140 
cagcaacttg gcgatggagc gcagggtttg gtttttgtcg cgatcggcgc gctccttggc 1200 
cgcgatgttt agctgcacgt attcgcgcgc aacgcaccgc cattcgggaa agacggtggt 1260 
gcgctcgtcg ggcaccaggt gcacgcgcca accgcggttg tgcagggtga caaggtcaac 1320 
gctggtggct acctctccgc gtaggcgctc gttggtccag cagaggcggc cgcccttgcg 1380 
cgagcagaat ggcggtaggg ggtctagctg cgtctcgtcc ggggggtctg cgtccacggt 1440 
aaagaccccg ggcagcaggc gcgcgtcgaa gtagtctatc ttgcatcctt gcaagtctag 1500 
cgcctgctgc catgcgcggg cggcaagcgc gcgctcgtat gggttgagtg ggggacccca 1560 
tggcatgggg tgggtgagcg cggaggcgta catgccgcaa atgtcgtaaa cgtagagggg 1620 
ctctctgagt attccaagat atgtagggta gcatcttcca ccgcggatgc tggcgcgcac 1680 
gtaatcgtat agttcgtgcg agggagcgag gaggtcggga ccgaggttgc tacgggcggg 1740 
ctgctctgct cggaagacta tctgcctgaa gatggcatgt gagttggatg atatggttgg 1800 
acgctggaag acgttgaagc tggcgtctgt gagacctacc gcgtcacgca cgaaggaggc 1860 
gtaggagtcg cgcagcttgt tgaccagctc ggcggtgacc tgcacgtcta gggegcagta 1920 
gtccagggtt tccttgatga tgtcatactt atcctgtccc ttttttttcc acagctcgcg 1980 
gttgaggaca aactcttcgc ggtctttcca gtactcttgg atcggaaacc cgtcggcctc 2040 
cgaacgagat ccgtactccg ccgccgaggg acctgagcga gtccgcatcg accggatcgg 2100 
aaaacctctc gagaaaggcg tctaaccagt cacagtcgca agatccaaga tgaagcgcgc 2160 
aagaccgtct gaagatacct tcaaccccgt gtatccatat gacacggaaa ccggtcctcc 2220 
aactgtgcct tttcttactc ctccctttgt atcccccaat gggtttcaag agagtccccc 2280 
tggggtactc tctttgcgcc tatccgaacc tctagttacc tccaatggca tgcttgcgct 2340 
caaaatgggc aacggcctct ctctggacga ggccggcaac cttacctccc aaaatgtaac 2400 
cactgtgagc ccacctctca aaaaaaccaa gtcaaacata aacctggaaa tatctgcacc 2460 
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cctcacagtt acctcagaag ccctaactgt 
caacacactc accatgcaat cacaggcccc 
tgccacccaa ggacccctca cagtgtcaga 
cctcaccacc accgatagca gtacccttac 
cactggtagc ttgggcattg acttgaaaga 
actaaagtac ggggctcctt tgcatgtaac 
tggtccaggt gtgactatta ataatacttc 
ttttgattca caaggcaata tgcaacttaa 
aaacagacgc cttatacttg atgttagtta 
aagactagga cagggccctc tttttataaa 
caaaggcctt tacttgttta cagcttcaaa 
cactgccaag gggttgatgt ttgacgctac 
tgaatttggt tcacctaatg caccaaacac 
cctagaattt gattcaaaca aggctatggt 
cagcacaggt gccattacag taggaaacaa 
accagctcca tctcctaact gtagactaaa 
cttaacaaaa tgtggcagtc aaatacttgc 
tttggctcca atatctggaa cagttcaaag 
tggagtgcta ctaaacaatt ccttcctgga 
tcttactgaa ggcacagcct atacaaacgc 
tccaaaatct cacggtaaaa ctgccaaaag 
agacaaaact aaacctgtaa cactaaccat 
cacaactcca agtgcatact ctatgtcafct 
taatgaaata tttgccacat cctcttacac 
gcggccgctc gagcatgcat ctagagggcc 
ctcgctgatc agcctcgact gtgccttcta 
ccgtgccttc cttgaccctg gaaggtgcca 
aaattgcatc gcattgtctg agtaggtgtc 
acagcaaggg ggaggattgg gaagacaata 
tggcttctga ggcggaaaga accagctggg 
gcggcgcatt aagcgcggcg ggtgtggtgg 
gcgccctagc gcccgctcct ttcgctttct 
ttccccgtca agctctaaat cggggcatcc 
acctcgaccc caaaaaactt gattagggtg 
agacggtttt tcgccctttg acgttggagt 
aaactggaac aacactcaac cctatctcgg 
ggatttcggc ctattggtta aaaaatgagc 
tctgtggaat gtgtgtcagt tagggtgtgg 
gtatgcaaag catgcatctc aattagtcag 
cagcaggcag aagtatgcaa agcatgcatc 
taactccgcc catcccgccc ctaactccgc 
gactaatttt ttttatttat gcagaggccg 
agtagtgagg aggctttttt ggaggcctag 
tatccatttt cggatctgat caagagacag 
atggattgca cgcaggttct ccggccgctt 
cacaacagac aatcggctgc tctgatgccg 
cggttctttt tgtcaagacc gacctgtccg 
cgcggctatc gtggctggcc acgacgggcg 
ctgaagcggg aagggactgg ctgctattgg 
ctcaccttgc tcctgccgag aaagtatcca 
cgcttgatcc ggctacctgc ccattcgacc 
gtactcggat ggaagccggt cttgtcgatc 
tcgcgccagc cgaactgttc gccaggctca 
tcgtgaccca tggcgatgcc tgcttgccga 
gattcatcga ctgtggccgg ctgggtgtgg 
cccgtgatat tgctgaagag cttggcggcg 
gtatcgccgc tcccgattcg cagcgcatcg 
gagcgggact ctggggttcg aaatgaccga 
tttcgattcc accgccgcct tctatgaaag 
cggctggatg atcctccagc gcggggatct 
gtttattgca gcttataatg gttacaaata 
agcatttttt tcactgcatt ctagttgtgg 



ggctgccgcc gcacctctaa tggtcgcggg 252 0 
gctaaccgtg cacgactcca aacttagcat 2580 
aggaaagcta gccctgcaaa catcaggccc 2640 
tatcactgcc tcaccccctc fcaactactgc 2700 
gcccatttat acacaaaafcg gaaaactagg 2760 
agacgaccta aacactttga ccgtagcaac 2820 
cttgcaaact aaagttactg gagccttggg 2880 
tgtagcagga ggactaagga ttgattctca 2940 
tccgtttgat gctcaaaacc aactaaatct 3000 
ctcagcccac aacttggata ttaactacaa 3060 
caattccaaa aagcttgagg ttaacctaag 3120 
agccatagcc attaatgcag gagatgggct 3180 
aaatcccctc aaaacaaaaa ttggccatgg 3240 
tcctaaacta ggaactggcc ttagttttga 3300 
aaataatgat aagctaactt tgtggaccac 3360 
tgcagagaaa gatgctaaac tcactttggt 3420 
tacagtttca gttttggctg ttaaaggcag 3480 
tgctcatctt attataagat ttgacgaaaa 354 0 
cccagaatat tggaacttta gaaatggaga 3600 
tgttggattt atgcctaacc tatcagctta 3660 
taacattgtc agtcaagttt acttaaacgg 3720 
tacactaaac ggtacacagg aaacaggaga 3780 
ttcatgggac tggtctggcc acaactacat 3840 
tttttcatac attgcccaag aataaaagaa 3900 
ctattctata gtgtcaccta aatgctagag 3960 
gttgccagcc atctgttgtt tgcccctccc 4020 
ctcccactgt cctttcctaa taaaatgagg 4080 
attctattct ggggggtggg gtggggcagg 414 0 
gcaggcatgc tggggatgcg gtgggctcta 4200 
gctctagggg gtatccccac gcgccctgta 4260 
ttacgcgcag cgtgaccgct acacttgcca 4320 
tcccttcctt tctcgccacg ttcgccggct 4380 
ctttagggtt ccgatttagt gctttacggc 4440 
atggttcacg tagtgggcca tcgccctgat 4500 
ccacgttctt taatagtgga ctcttgttcc 4560 
tctattcttt tgatttataa gggattttgg 4620 
tgatttaaca aaaatttaac gcgaattaat 4680 
aaagtcccca ggctccccag gcaggcagaa 4740 
caaccaggtg tggaaagtcc ccaggctccc 4800 
tcaattagtc agcaaccata gtcccgcccc 4860 
ccagttccgc ccattctccg ccccatggct 4920 
aggccgcctc tgcctctgag ctattccaga 4980 
gcttttgcaa aaagctcccg ggagcttgta 5040 
gatgaggatc gtttcgcatg attgaacaag 5100 
gggtggagag gctattcggc tatgactggg 5160 
ccgtgttccg gctgtcagcg caggggcgcc 522 0 
gtgccctgaa tgaactgcag gacgaggcag 5280 
ttccttgcgc agctgtgctc gacgttgtca 534 0 
gcgaagtgcc ggggcaggat ctcctgtcat 5400 
tcatggctga tgcaatgcgg cggctgcata 5460 
accaagcgaa acatcgcatc gagcgagcac 5520 
aggatgatct ggacgaagag catcaggggc 5580 
aggcgcgcat gcccgacggc gaggatctcg 5640 
atatcatggt ggaaaatggc cgcttttctg 5700 
cggaccgcta tcaggacata gcgttggcta 5760 
aatgggctga ccgcttcctc gtgctttacg 5820 
ccttctatcg ccttcttgac gagttcttct 5880 
ccaagcgacg cccaacctgc catcacgaga 5940 
gttgggcttc ggaatcgttt tccgggacgc 6000 
catgctggag ttcttcgccc accccaactt 6060 
aagcaatagc atcacaaatt tcacaaataa 6120 
tttgtccaaa ctcatcaatg tatcttatca 6180 
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tgtctgtata ccgtcgacct ctagctagag cttggcgtaa tcatggtcat agctgtttcc 6240 
tgtgtgaaat tgttaEccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 6300 
tLlglctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 6360 
cgctttccil tlgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aaogcgcggg 6420 
gagaggcggl ttglgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 6480 
Igtcgttlig cticggcgal Sggtatcagc tcactcaaag gcggtaatac ggttatccac 6540 
IgaatcagH gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 6600 
clgtaaallg gccgcgttgc Iggcgttttt ccataggctc cgcccccctg acgagcatca 6660 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 6720 
qtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 6780 
Stglclgcc t?tclccctt cggglagcgt ggcgctttct caatgctcac S«gtaggta 6840 
tctlagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 6900 
gcccglccgl tglgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 6960 
cttaHcgcIa cfgicagcag ccactggtaa caggattagc agagcgaggt ^gtaggcgg 7020 
tactacaqaq ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 7080 
tltctgcgct ctgctlatgc clgftacctt cggaaaaaga gttggtagct cttgatccgg 7140 
caaaciaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 7200 
laaalaagga tltclapag atc^ttgat cttttctacg gggtctgacg ctcagtggaa 7260 
cgaaaactla cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 7320 
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 7380 
tgacagttac caatgctlaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 7440 
atccatagtt gcctgactcc ccgtcgtgta gataactaog atacgggagg f=«accatc 7500 
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 7560 
aftalaccag Icafccggla gggccgagcg cagaagtggt cctgcaactt tatccgcctc 7620 
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 7680 
gcgcaalgtt gttgccaEtg ctacaggcat cgtggtgtca cgctcgtcgt "ggtatggc 7740 
Itcattclgc Iccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 7800 
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg = c 9°^9" 7860 
atcactcata gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 7920 
cttttcStg lctggt?agt actclaccaa gtcattctga aaatagtgta tgcggcgacc 7980 
aaattactct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa B040 

& &Ss S22SB K=3S S552 SSK f 
SBS KKSK SffiSS 55 S SS 

llagggttl? tgtclcalga gcggatacat atttgaatgt atttagaaaa ataaacaaat 8340 
aggggttccg cgcacatttc cccgaaaagt gccacctgac gtc 

<210> 30 
<211> 7960 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: plasmid pDV67 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctebgatg 60 
Icacataatt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
Iglgcaalat ?tlagc?aca. acaaggcaa| gcttgaccga caattgcatg ^atctgc 180 
t-taaacrttaa acattttacg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattlftgal tagttattll tagtaatcal ttlcggggtc attagttcat -fcccatata 300 
Iggagttlcg cgttacataa cttacggtaa atggcccgcc tggctgaccg ^aacgacc \ 60 
clcgcccatt glcgtcaata atgacgtatg ttcccatagt aacgccaata 999actttcc 420 
attaacatca atcwcrtggac tatttacggt aaactgccca cttggcagta catcaagtgt »»w 
atcltafgcc aaltlcfccc cctattglcg tcaatgacgg taaatggccc Sfctggcatt 540 
atgcccaita cafgacltta tgggactttc ctacttggca gtacatctac 9tattagtca 600 
tcfctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga ^|9tttg "J 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg "ttggcacc /zu 
aaaatcllll ggactttcla aaatgtcgta acaactccgc cccattgacg "aatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca o«u 
Itglttlctg gcttatllaa attaatacga ctcactatag ggagacccaa gctggctagc 900 
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gtttaaactt aagcttggta ccgagctcgg 
gggccagctg ttggggtgag tactccctct 
tgtcagtttc caaaaacgag gaggatttga 
gggtggccgc atccatctgg tcagaaaaga 
acgacccgta gagggcgttg gacagcaact 
cgcgatcggc gcgctccttg gccgcgatgt 
gccattcggg aaagacggtg gtgcgctcgt 
tgtgcagggt gacaaggtca acgctggtgg 
agcagaggcg gccgcccttg cgcgagcaga 
ccggggggtc tgcgtccacg gtaaagaccc 
tcttgcatcc ttgcaagtct agcgcctgct 
atgggttgag tgggggaccc catggcatgg 
aaatgtcgta aacgtagagg ggctctctga 
caccgcggat gctggcgcgc acgtaatcgt 
gaccgaggtt gctacgggcg ggctgctctg 
gtgagttgga tgatatggtt ggacgctgga 
ccgcgtcacg cacgaaggag gcgtaggagt 
cctgcacgtc tagggcgcag tagtccaggg 
cctttttttt ccacagctcg cggttgagga 
ggatcggaaa cccgtcggcc tccgaacgag 
gagtccgcat cgaccggatc ggaaaacctc 
caagatccaa gatgaagcgc gcaagaccgt 
atgacacgga aaccggtcct ccaactgtgc 
atgggtttca agagagtccc cctggggtac 
cctccaatgg catgcttgcg ctcaaaatgg 
accttacctc ccaaaatgta accactgtga 
taaacctgga aatatctgca cccctcacag 
ccgcacctct aatggtcgcg ggcaacacac 
tgqacgactc caaacttagc attgccaccc 
tagccctgca aacatcaggc cccctcacca 
cctcaccccc tctaactact gccactggta 
atacacaaaa tggaaaacta ggactaaagt 
taaacacttt gaccgtagca actggtccag 
ctaaagttac tggagccttg-ggttttgatt 
gaggactaag gattgattct caaaacagac 
atgctcaaaa ccaactaaat ctaagactag 
acaacttgga tattaactac aacaaaggcc 
aaaagcttga ggttaaccta agcactgcca 
ccattaatgc aggagatggg cttgaatttg 
tcaaaacaaa aattggccat ggcctagaat 
taggaactgg ccttagtttt gacagcacag 
ataagctaac tttgtggacc acaccagctc 
aagatgctaa actcactttg gtcttaacaa 
cagttttggc tgttaaaggc agtttggctc 
ttattataag atttgacgaa aatggagtgc 
attggaactt tagaaatgga gatcttactg 
ttatgcctaa cctatcagct tatccaaaat 
tcagtcaagt ttacttaaac ggagacaaaa 
acggtacaca ggaaacagga gacacaactc 
actggtctgg ccacaactac attaatgaaa 
acattgccca agaataaaag aagcggccgc 
gatcagcctc gactgtgcct tctagttgcc 
cttccttgac cctggaaggt gccactccca 
catcgcattg tctgagtagg tgtcattcta 
agggggagga ttgggaagac aatagcaggc 
ctgaggcgga aagaaccagc tggggctrcta 
cattaagcgc ggcgggtgtg gtggttacgc 
tagcgcccgc tcctttcgct ttcttccctt 
gtcaagctct aaatcggggc atccctttag 
accccaaaaa acttgattag ggtgatggtt 
tttttcgccc tttgacgttg gagtccacgt 
gaacaacact caaccctatc tcggtctatt 



atccactctc ttccgcatcg ctgtctgcga 960 
gaaaagcggg catgacttct gcgctaagat 1020 
tattcacctg gcccgcggtg atgcctttga 1080 
caatcttttt gttgtcaagc ttggtggcaa 1140 
tggcgatgga gcgcagggtt tggtttttgt 1200 
ttagctgcac gtattcgcgc gcaacgcacc 1260 
cgggcaccag gtgcacgcgc caaccgcggt 1320 
ctacctctcc gcgtaggcgc tcgttggtcc 1380 
atggcggtag ggggtctagc tgcgtctcgt 1440 
cgggcagcag gcgcgcgtcg aagtagtcta 1500 
gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
ggtgggtgag cgcggaggcg tacatgccgc 1620 
gtattccaag atatgtaggg tagcatcttc 1680 
atagttcgtg cgagggagcg aggaggtcgg 1740 
ctcggaagac tatctgcctg aagatggcat 1800 
agacgttgaa gctggcgtct gtgagaccta 1860 
cgcgcagctt gttgaccagc tcggcggtga 1920 
tttccttgat gatgtcatac ttatcctgtc 1980 
caaactcttc gcggtctttc cagtactctt 2040 
atccgtactc cgccgccgag ggacctgagc 2100 
tcgagaaagg cgtctaacca gtcacagtcg 2160 
ctgaagatac cttcaacccc gtgtatccat 2220 
cttttcttac tcctcccttt gtatccccca 2280 
tctctttgcg cctatccgaa cctctagtta 2340 
gcaacggcct ctctctggac gaggccggca 2400 
gcccacctct caaaaaaacc aagtcaaaca 2460 
ttacctcaga agccctaact gtggctgccg 2520 
tcaccatgca atcacaggcc ccgctaaccg 2580 
aaggacccct cacagtgtca gaaggaaagc 2640 
ccaccgatag cagtaccctt actatcactg 2700 
gcttgggcat tgacttgaaa gagcccattt 2760 
acggggctcc tttgcatgta acagacgacc 2820 
gtgtgactat taataatact tccttgcaaa 2880 
cacaaggcaa tatgcaactt aatgtagcag 2 940 
gccttatact tgatgttagt tatccgtttg 3000 
gacagggccc tctttttata aactcagccc 3060 
tttacttgtt tacagcttca aacaattcca 3120 
aggggttgat gtttgacgct acagccatag 3180 
gttcacctaa tgcaccaaac acaaatcccc 3240 
ttgattcaaa caaggctatg gttcctaaac 3300 
gtgccattac agtaggaaac aaaaataatg 3360 
catctcctaa ctgtagacta aatgcagaga 3420 
aatgtggcag tcaaatactt gctacagttt 3480 
caatatctgg aacagttcaa agtgctcatc 3540 
tactaaacaa ttccttcctg gacccagaat 3600 
aaggcacagc ctatacaaac gctgttggat 3660 
ctcacggtaa aactgccaaa agtaacattg 3720 
ctaaacctgt aacactaacc attacactaa 3780 
caagtgcata ctctatgtca ttttcatggg 3840 
tatttgccac atcctcttac actttttcat 3900 
tcgagtctag agggcccgtt taaacccgct 3960 
agccatctgt tgtttgcccc tcccccgtgc 4020 
ctgtcctttc ctaataaaat gaggaaattg 4080 
ttctgggggg tggggtgggg caggacagca 4140 
atgctgggga tgcggtgggc tctatggctt 4200 
gggggtatcc ccacgcgccc tgtagcggcg 4260 
gcagcgtgac cgctacactt gccagcgccc 4320 
cctttctcgc cacgttcgcc ggctttcccc 4380 
ggttccgatt tagtgcttta cggcacctcg 4440 
cacgtagtgg gccatcgccc tgatagacgg 4500 
tctttaatag tggactcttg ttccaaactg 4560 
cttttgattt ataagggatt ttggggattt 4620 
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cggcctattg gttaaaaaat gagctgattt 
gaatgtgtgt cagttagggt gtggaaagtc 
aaagcatgca tctcaattag tcagcaacca 
gcagaagtat gcaaagcatg catctcaatt 
cgcccatccc gcccctaact ccgcccagtt 
ttttttttat ttatgcagag gccgaggccg 
gaggaggctt ttttggaggc ctaggctttt 
ttttcggatc tgatcagcac gtgttgacaa 
ataatacgac aaggtgagga actaaaccat 
caccgcgcgc gacgtcgccg gagcggtcga 
ggacttcgtg gaggacgact tcgccggtgt 
cgcggtccag gaccaggtgg tgccggacaa 
ggacgagctg tacgccgagt ggtcggaggt 
gccggccatg accgagatcg gcgagcagcc 
ggccggcaac tgcgtgcact tcgtggccga 
cgattccacc gccgccttct atgaaaggtt 
ctggatgatc ctccagcgcg gggatctcat 
tattgcagct tataatggtt acaaataaag 
atttttttca ctgcattcta gttgtggttt 
ctgtataccg tcgacctcta gctagagctt 
gtgaaattgt tatccgctca caattccaca 
agcctggggt gcctaatgag tgagctaact 
tttccagtcg ggaaacctgt cgtgccagct 
aggcggtttg cgtattgggc gctcttccgc 
cgttcggctg cggcgagcgg tatcagctca 
atcaggggat aacgcaggaa agaacatgtg 
taaaaaggcc gcgttgctgg cgtttttcca 
aaatcgacgc tcaagtcaga ggtggcgaaa 
tccccctgga agctccctcg tgcgctctcc 
gtccgccttt ctcccttcgg gaagcgtggc 
cagttcggtg taggtcgttc gctccaagct 
cgaccgctgc gccttatccg gtaactatcg 
atcgccactg gcagcagcca ctggtaacag 
tacagagttc ttgaagtggfc ggcctaacta 
ctgcgctctg ctgaagccag ttaccttcgg 
acaaaccacc gctggtagcg gtggtttttt 
aaaaggatct caagaagatc ctttgatctt 
aaactcacgt taagggattt tggtcatgag 
tttaaattaa aaatgaagtt ttaaatcaat 
cagttaccaa tgcttaatca gtgaggcacc 
catagttgcc tgactccccg tcgtgtagat 
ccccagtgct gcaatgatac cgcgagaccc 
aaaccagcca gccggaaggg ccgagcgcag 
ccagtctatt aattgttgcc gggaagctag 
caacgttgtt gccattgcta caggcatcgt 
attcagctcc ggttcccaac gatcaaggcg 
agcggttagc tccttcggtc ctccgatcgt 
actcatggtt atggcagcac tgcataattc 
ttctgtgact ggtgagtact caaccaagtc 
ttgctcttgc ccggcgtcaa tacgggataa 
gctcatcatt ggaaaacgtt cttcggggcg 
atccagttcg atgtaaccca ctcgtgcacc 
cagcgtttct gggtgagcaa aaacaggaag 
gacacggaaa tgttgaatac tcatactctt 
gggttattgt ctcatgagcg gatacatatt 
ggttccgcgc acatttcccc gaaaagtgcc 

<210> 31 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



aacaaaaatt taacgcgaat taattctgtg 4680 
cccaggctcc ccaggcaggc agaagtatgc 4740 
ggtgtggaaa gtccccaggc tccccagcag 4800 
agtcagcaac catagtcccg cccctaactc 4860 
ccgcccattc tccgccccat ggctgactaa 4920 
cctctgcctc tgagctattc cagaagtagt 4980 
gcaaaaagct cccgggagct tgtatatcca 5040 
ttaatcatcg gcatagtata tcggcatagt 5100 
ggccaagttg accagtgccg ttccggtgct 5160 
gttctggacc gaccggctcg ggttctcccg 5220 
ggtccgggac gacgtgaccc tgttcatcag 5280 
caccctggcc tgggtgtggg tgcgcggcct 5340 
cgtgtccacg aacttccggg acgcctccgg 5400 
gtgggggcgg gagttcgccc tgcgcgaccc 5460 
ggagcaggac tgacacgtgc tacgagattt 5520 
gggcttcgga atcgttttcc gggacgccgg 5580 
gctggagttc ttcgcccacc ccaacttgtt 5640 
caatagcatc acaaatttca caaataaagc 5700 
gtccaaactc atcaatgtat cttatcatgt 5760 
ggcgtaatca tggtcatagc tgtttcctgt 5820 
caacatacga gccggaagca taaagtgtaa 5880 
cacattaatt gcgttgcgct cactgcccgc 5940 
gcattaatga atcggccaac gcgcggggag 6000 
ttcctcgctc actgactcgc tgcgctcggt 6060 
ctcaaaggcg gtaatacggt tatccacaga 6120 
agcaaaaggc cagcaaaagg ccaggaaccg 6180 
taggctccgc ccccctgacg agcatcacaa 6240 
cccgacagga ctataaagat accaggcgtt 6300 
tgttccgacc ctgccgctta ccggatacct 63 60 
gctttctcaa tgctcacgct gtaggtatct 6420 
gggctgtgtg cacgaacccc ccgttcagcc 6480 
tcttgagtcc aacccggtaa gacacgactt 6540 
gattagcaga gcgaggtatg taggcggtgc 6600 
cggctacact agaaggacag tatttggtat 6660 
aaaaagagtt ggtagctctt gatccggcaa 6720 
tgtttgcaag cagcagatta cgcgcagaaa 6780 
ttctacgggg tctgacgctc agtggaacga 6840 
attatcaaaa aggatcttca cctagatcct 6900 
ctaaagtata tatgagtaaa cttggtctga 6960 
tatctcagcg atctgtctat ttcgttcatc 7020 
aactacgata cgggagggct taccatctgg 7080 
acgctcaccg gctccagatt tatcagcaat 7140 
aagtggtcct gcaactttat ccgcctccat 7200 
agtaagtagt tcgccagtta atagtttgcg 7260 
ggtgtcacgc tcgtcgtttg gtatggcttc 7320 
agttacatga tcccccatgt tgtgcaaaaa 73 80 
tgtcagaagt aagttggccg cagtgttatc 7440 
tcttactgtc atgccatccg taagatgctt 7500 
attctgagaa tagtgtatgc ggcgaccgag 7560 
taccgcgcca catagcagaa ctttaaaagt 7620 
aaaactctca aggatcttac cgctgttgag 7680 
caactgatct tcagcatctt ttactttcac 7740 
gcaaaatgcc gcaaaaaagg gaataagggc 7800 
cctttttcaa tattattgaa gcatttatca 7860 
tgaatgtatt tagaaaaata aacaaatagg 7920 
acctgacgtc 7960 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 31 

atgggatcca agatgaagcg cgcaagaccg 30 

<210> 32 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 32 

cactatagcg gccgcattct cagtcatctt 30 

<210> 33 
<211> 7989 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV69 
<400> 33 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 3 00 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 
gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 
gggccagctg ttggggtgag tactccctct gaaaagcggg catgacttct gcgctaagat 1020 
tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 1080 
gggtggccgc atccatctgg tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa 1140 
acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 1200 
cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 1260 
gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 1320 
tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc 1380 
agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 1440 
ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 1500 
tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 1620 
aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 1680 
caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg 1740 
gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 1800 
gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 1860 
ccgcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 1920 
cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 
cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 2040 
ggatcggaaa cccgtcggcc tccgaacgag atccgtactc cgccgccgag ggacctgagc 2100 
gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg 2160 



• 
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caagatccaa gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc 
tgcacgactc caaacttagc attgccaccc. aaggacccct cacagtgtca 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt 
gaggactaag gattgattct caaaacagac gccttatact tgatgttagt 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata 
acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgct 
ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa «aggctatg 
taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac 
ataagctaac tttgtggacc ggtccaaaac cagaagccaa ctgcataatt 
aacaSaaccc agatagcaaa ctaactttaa tccttgtaaa aaatggagga 
gatatgtaac gctaatggga gcctcagact acgttaacac ^ttatttaaa 
Ectccattaa tgtagaacta tactttgatg ccactggtca tatattacca 
ctcttaaaac agatctagaa ctaaaataca agcaaaccgc tgactttagt 
ttatgccaag tactacagcg tatccatttg tccttcctaa tgcgggaaca 
attatatttt tggtcaatgc tactacaaag caagcgatgg tgcccttttt 
ttactgttat gcttaataaa cgcctgccag atagtcgcac atcctatgtt 
tatqgtcctt gaatgctggt ctagctccag aaactactca ggcaaccctc 
catttacctt ttcctatatt agagaagatg actgattttt aagaagcggc 
tagagggccc gtttaaaccc gctgatcagc ctcgactgtg ccttctagtt 
tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc 
ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt a f9tftcatt 
gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca 
llltgclitl liltctatgg cttctgaggc ggaaagaacc snccntagct 
ggggtatccc cacgcgccct gtagcggcgc attaagcgcg 9cgggtgtgg 
calcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcggggca 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa ^ttgattagg 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 
ctttaatagt ggactcttgt tccaaactgg aa caa«ctc aaccctatct 
ttttgattta taagggattt tggggatttc ggcctattgg t^aaaaatg 
acaaaaattt aacgcgaatt aattctgtgg aatgtgtgtc a 9ttagggtg 
ccaggctccc caggcaggca gaagtatgca aagcatgcat ctcaattagt 
gt|tggaaag tcllcalgct ccccagcagg cagaagtatg "aagcatgc 
itcagcaacc atagtcccgc ccctaactcc gcccatcccg cccctaactc 
cgcccattct ccglcccatg gctgactaat tttttttatt tatgcagagg 
ctctgcctct gagctattcc agaagtagtg aggaggcttt tttggaggcc 
caaaaagctc ccgggagctt gtatatccat tttcggatct gatcagcacg 
taatcatcgg calSgtatat cggcatagta taatacgaca aggtgaggaa 
gccaagttga ccagtgccgt tccggtgctc accgcgcgcg acgtcgccgg 
Etctggaccg accggctcgg gttctcccgg gacttcgtgg aggacgactt 
gtccgggacg acgtgaccct gttcatcagc gcggtccagg acca ^tggt 
Iccctiicct gggtgtgggt gcgcggcctg gacgagctgt a ^ccgagtg 
gtgtccacga acttccggga cgcctccggg ccggccatga ccgagatqgg 
tgggggcggg agttcgccct gcgcgacccg gccggcaact gcgtgcactt 
gagcaigact gacacgtgct acgagatttc gattccaccg ccgccttcta 
ggcttcigaa tcgttttccg ggacgccggc tggatgatcc tccagcgcgg 
ctggagttct tcgcccaccc caacttgttt attgcagctt ataatggtta 
aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 
tccaaactca tcaatgtatc ttatcatgtc tgtataccgt c 9 ac ^tctag 
gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt atccgctcac 



gtgtatccat 2220 
gtatccccca 2280 
cctctagtta 2340 
gaggccggca 2400 
aagtcaaaca 2460 
gtggctgccg 2520 
ccgctaaccg 2580 
gaaggaaagc 2640 
actatcactg 2700 
gagcccattt 2760 
acagacgacc 2820 
tccttgcaaa 2880 
aatgtagcag 2940 
tatccgtttg 3000 
aactcagccc 3060 
aacaattcca 3120 
acagccatag 3180 
acaaatcccc 3240 
gttcctaaac 3300 
aaaaataatg 3360 
gaatacggga 3420 
attgttaatg 3480 
aacaaaaatg 3540 
gactcatctt 3600 
gcaagaggtt 3660 
cataatgaaa 3720 
ccgttggaag 3780 
atgacttttt 3 840 
ataacctccc 3900 
cgctcgagtc 3960 
gccagccatc 4020 
ccactgtcct 4080 
ctattctggg 4140 
ggcatgctgg 4200 
ggggctctag 4260 
tggttacgcg 4320 
tcttcccttc 4380 
tccctttagg 4440 
gtgatggttc 4500 
agtccacgtt 4560 
cggtctattc 4620 
agctgattta 4680 
tggaaagtcc 4740 
cagcaaccag 4800 
atctcaatta 4860 
cgcccagttc 4920 
ccgaggccgc 4980 
taggcttttg 5040 
tgttgacaat 5100 
ctaaaccatg 5160 
agcggtcgag 5220 
cgccggtgtg 5280 
gccggacaac 5340 
gtcggaggtc 5400 
cgagcagccg 5460 
cgtggccgag 5520 
tgaaaggttg 5580 
ggatctcatg 5640 
caaataaagc 5700 
ttgtggtttg 5760 
ctagagcttg 5820 
aattccacac 5880 
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aacatacgag 

acattaattg 

cattaatgaa 

tcctcgctca 

tcaaaggcgg 

gcaaaaggcc 

aggctccgcc 

ccgacaggac 

gttccgaccc 

ctttctcaat 

ggctgtgtgc 

cttgagtcca 

attagcagag 

ggctacacta 

aaaagagttg 

gtttgcaagc 

tctacggggt 

ttatcaaaaa 

taaagtatat 

atctcagcga 

actacgatac 

cgctcaccgg 

agtggtcctg 

gtaagtagtt 

gtgtcacgct 

gttacatgat 

gtcagaagta 

cttactgtca 

ttctgagaat 

accgcgccac 

aaactctcaa 

aactgatctt 

caaaatgccg 

ctttttcaat 

gaatgtattt 

cctgacgtc 



ccggaagcat 

cgttgcgctc 

tcggccaacg 

ctgactcgct 

taatacggtt 

agcaaaaggc 

cccctgacga 

tataaagata 

tgccgcttac 

gctcacgctg 

acgaaccccc 

acccggtaag 

cgaggtatgt 

gaaggacagt 

gtagctcttg 

agcagattac 

ctgacgctca 

ggatcttcac 

atgagtaaac 

tctgtctatt 

gggagggctt 

ctccagattt 

caactttatc 

cgccagttaa 

cgtcgtttgg 

cccccatgtt 

agttggccgc 

tgccatccgt 

agtgtatgcg 

atagcagaac 

ggatcttacc 

cagcatcttt 

caaaaaaggg 

attattgaag 

agaaaaataa 



aaagtgtaaa 

actgcccgct " 

cgcggggaga 

gcgctcggtc 

atccacagaa 

caggaaccgt 

gcatcacaaa 

ccaggcgttt 

cggatacctg 

taggtatctc 

cgttcagccc 

acacgactta 

aggcggtgct 

atttggtatc 

atccggcaaa 

gcgcagaaaa 

gtggaacgaa 

ctagatcctt 

ttggtctgac 

tcgttcatcc 

accatctggc 

atcagcaata 

cgcctccatc 

tagtttgcgc 

tatggcttca 

gtgcaaaaaa 

agtgttatca 

aagatgcttt 

gcgaccgagt 

tttaaaagtg 

gctgttgaga 

tactttcacc 

aataagggcg 

catttatcag 

acaaataggg 



gcctggggtg 

ttccagtcgg 

ggcggtttgc 

gttcggctgc 

tcaggggata 

aaaaaggccg 

aatcgacgct 

ccccctggaa 

tccgcctttc 

agttcggtgt 

gaccgctgcg 

tcgccactgg 

acagagttct 

tgcgctctgc 

caaaccaccg 

aaaggatctc 

aactcacgtt 

ttaaattaaa 

agttaccaat 

atagttgcct 

cccagtgctg 

aaccagccag 

cagtctatta 

aacgttgttg 

ttcagctccg 

gcggttagct 

ctcatggtta 

tctgtgactg 

tgctcttgcc 

ctcatcattg 

tccagttcga 

agcgtttctg 

acacggaaat 

ggttattgtc 

gttccgcgca 



cctaatgagt 
gaaacctgtc 
gtattgggcg 
ggcgagcggt 
acgcaggaaa 
cgttgctggc 
caagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
gcttaatcag 
gactccccgt 
caatgatacc 
ccggaagggc 
attgttgccg 
ccattgctac 
gttcccaacg 
ccttcggtcc 
tggcagcact 
gtgagtactc 
cggcgtcaat 
gaaaacgttc 
tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
catttccccg 



gagctaactc 5940 
gtgccagctg 6000 
ctcttccgct 6060 
atcagctcac 6120 
gaacatgtga 6180 
gtttttccat 6240 
gtggcgaaac 6300 
gcgctctcct 6360 
aagcgtggcg 6420 
ctccaagctg 6480 
taactatcgt 6540 
tggtaacagg 6600 
gcctaactac 6660 
taccttcgga 6720 
tggttttttt 6780 
tttgatcttt 6840 
ggtcatgaga 6900 
taaatcaatc 6960 
tgaggcacct 7020 
cgtgtagata 7080 
gcgagaccca 7140 
cgagcgcaga 7200 
ggaagctaga 7260 
aggcatcgtg 7320 
atcaaggcga 7380 
tccgatcgtt 7440 
gcataattct 7500 
aaccaagtca 7560 
acgggataat 7620 
ttcggggcga 7680 
tcgtgcaccc 7740 
aacaggaagg 7800 
catactcttc 7860 
atacatattt 7920 
aaaagtgcca 7980 
7989 



<210> 34 
<211> 7607 
<212> DNA 

<213> Artificial Sequence 



lllll Description of Artificial Sequence: plasmid GRE5 -El - SV4 0 -Hygro 



<400> 34 

tctagaagat 

ttctagctac 

gtacaggatg 

agatcgatct 

gcgcgttcgt 

aacttcaggg 

atgttatatg 

tatcaccatg 

ttgtctcctc 

ttgtaacgaa 

tcactttttt 

acaattgtta 

ggaaatattc 

tggttacaat 

ccctctgcta 

gttattgtgc 



ccgctgtaca 
tttattagat 
ttctagctac 
cctggccgtt 
cctcactctc 
tgagtttggg 
gagggggcaa 
gaccctcatg 
ttattttctt 
tttttaaatt 
ttcaaggcaa 
taattaaatg 
ttattggtag 
gatatacact 
accatgttca 
tgtctcatca 



ggatgttcta 
ccgctgtaca 
tttattagat 
cggggtcaaa 
ttccgcatcg 
gacccttgat 
agttttcagg 
ataattttgt 
ttcattttct 
cacttttgtt 
tcagggtata 
ataaggtaga 
aaacaactac 
gtttgagatg 
tgccttcttc 
ttttggcaaa 



gctactttat 
ggatgttcta 
ccgtgtacag 
aaccaggttt 
ctgtctgcga 
tgttctttct 
gtgttgttta 
ttctttcact 
gtaacttttt 
tatttgtcag 
ttatattgta 
atatttctgc 
atcctggtca 
aggataaaat 
tttttcctac 
gaattagatc 



tagatccgct 
gctactttat 
gatgttctag 
ggctataaaa 
gggccaggat 
ttttcgctat 
gaatgggaag 
ttctactctg 
cgttaaactt 
attgtaagta 
cttcagcaca 
atataaattc 
tcatcctgcc 
actctgagtc 
agctcctggg 
taagcttctg 



gtacaggatg 60 
tagatccgct 120 
ctactttatt 180 

gggggtgggg 240 

cgatcctgag 300 
tgtaaaattc 360 
atgtcccttg 420 
ttgacaacca 480 
tagcttgcat 540 
ctttctctaa 600 
gttttagaga 660 
tggctggcgt 720 
tttctcttta 780 
caaaccgggc 840 
caacgtgctg 900 
cagctcgagg 960 



# 
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actcggtcga ctgaaaatga gacatattat ctgccacgga ggtgttatta ccgaagaaat 1020 
ggccgccagt cttttggacc agctgatcga agaggtactg gctgataatc ttccacctcc 1080 
tagccatttt gaaccaccta cccttcacga actgtatgat ttagacgtga cggcccccga 1140 
agatcccaac gaggaggcgg tttcgcagat ttttcccgac tctgtaatgt tggcggtgca 1200 
ggaagggatt gacttactca cttttccgcc ggcgcccggt tctccggagc cgcctcacct 1260 
ttcccggcag cccgagcagc cggagcagag agccttgggt ccggtttcta tgccaaacct 1320 
tgtaccggag gtgatcgatc ttacctgcca cgaggctggc tttccaccca gtgacgacga 1380 
ggatgaagag ggtgaggagt ttgtgttaga ttatgtggag caccccgggc acggttgcag 1440 
gtcttgtcat tatcaccgga ggaatacggg ggacccagat attatgtgtt cgctttgcta 1500 
tatgaggacc tgtggcatgt ttgtctacag taagtgaaaa ttatgggcag tgggtgatag 1560 
agtggtgggt ttggtgtggt aatttttttt ttaattttta cagttttgtg gtttaaagaa 1620 
ttttgtattg tgattttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 1680 
gaaccggagc ctgcaagacc tacccgccgt cctaaaatgg cgcctgctat cctgagacgc 1740 
ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct 1800 
tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1860 
gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagcct 1920 
gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 1980 
cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 2040 
ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc 2100 
cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 2160 
tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt 2220 
ctgtggggct catcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 2280 
tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 2340 
gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 2400 
gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc 2460 
ggggggtacc tgctggattt tctggccatg catctgtgga gagcggttgt gagacacaag 2520 
aatcgcctgc tactgttgtc ttccgtccgc ccggcgataa taccgacgga ggagcagcag 2580 
cagcagcagg aggaagccag gcggcggcgg caggagcaga gcccatggaa cccgagagcc 2640 
ggcctggacc ctcgggaatg aatgttgtac aggtggctga actgtatcca gaactgagac 2700 
gcattttgac aattacagag gatgggcagg ggctaaaggg ggtaaagagg gagcgggggg 2760 
cttgtgaggc tacagaggag gctaggaatc tagcttttag cttaatgacc agacaccgtc 2 820 
ctgagtgtat tacttttcaa cagatcaagg ataattgcgc taatgagctt gatctgctgg 2880 
cgcagaagta ttccatagag cagctgacca cttactggct gcagccaggg gatgattttg 2940 
aggaggctat tagggtatat gcaaaggtgg cacttaggcc agattgcaag tacaagatca 3000 
gcaaacttgt aaatatcagg aattgttgct acatttctgg gaacggggcc gaggtggaga 3060 
tagatacgga ggatagggtg gcctttagat gtagcatgat aaatatgtgg ccgggggtgc 3120 
ttggcatgga cggggtggtt attatgaatg taaggtttac tggccccaat tttagcggta 3180 
cggttttcct ggccaatacc aaccttatcc tacacggtgt aagcttctat gggtttaaca 3240 
atacctgtgt ggaagcctgg accgatgtaa gggttcgggg ctgtgccttt tactgctgct 3300 
ggaagggggt ggtgtgtcgc cccaaaagca gggcttcaat taagaaatgc ctctttgaaa 3360 
ggtgtacctt gggtatcctg tctgagggta actccagggt gcgccacaat gtggcctccg 3420 
actgtggttg cttcatgcta gtgaaaagcg tggctgtgat taagcataac atggtatgtg 3480 
gcaactgcga ggacagggcc tctcagatgc tgacctgctc ggacggcaac tgtcacctgc 3540 
tgaagaccat tcacgtagcc agccactctc gcaaggcctg gccagtgttt gagcataaca 3600 
tactgacccg ctgttccttg catttgggta acaggagggg ggtgttccta ccttaccaat 3660 
gcaatttgag tcacactaag atattgcttg agcccgagag catgtccaag gtgaacctga 3720 
acggggtgtt tgacatgacc atgaagatct ggaaggtgct gaggtacgat gagacccgca 3780 
ccaggtgcag accctgcgag tgtggcggta aacatattag gaaccagcct gtgatgctgg 3840 
atgtgaccga ggagctgagg cccgatcact tggtgctggc ctgcacccgc gctgagtttg 3900 
gctctagcga tgaagataca gattgaggta ctgaaatgtg tgggcgtggc ttaagggtgg 3960 
gaaagaatat ataaggtggg ggtcttatgt agttttgtat ctgttttgca gcagccgccg 4020 
ccgccatgag caccaactcg tttgatggaa gcattgtgag ctcatatttg acaacgcgca 4080 
tgcccccatg ggccggggtg cgtcagaatg tgatgggctc cagcattgat ggtcgccccg 4140 
tcctgcccgc aaactctact accttgacct acgagaccgt gtctggaacg ccgttggaga 4200 
ctgcagcctc cgccgccgct tcagccgctg cagccaccgc ccgcgggatt gtgactgact 4260 
ttgctttcct gagcccgctt gcaagcagtg cagcttcccg ttcatccgcc cgcgatgaca 4320 
agttgacggc tcttttggca caattggatt ctttgacccg ggaacttaat gtcgtttctc 4380 
agcagctgtt ggatctgcgc cagcaggttt ctgccctgaa ggcttcctcc cctcccaatg 4440 
cggtttaaaa cataaataaa aaaccagact ctgtttggat ttggatcaag caagtgtctt 4500 
gctgtctcag ctgactgctt aagtcgcaag ccgaattgga tccaattcgg atcgatctta 4560 
ttaaagcaga acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca 4620 
aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc 4680 
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aatgtatctt 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 
agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 
acttggtctg 
tttcgttcat 
ttaccatctg 
ttatcagcaa 
tccgcctcca 
aatagtttgc 
ggtatggctt 
ttgtgcaaaa 
gcagtgttat 
gtaagatgct 
cggcgaccga 
actttaaaag 
ccgctgttga 
tttactttca 
ggaataaggg 
agcatttatc 
aaacaaatag 
attattatca 
gcgtttcggt 
ttgtctgtaa 
cgggtgtcgg 
tatgcggtgt 
agcgttaata 
caataggccg 
agtgttgttc 
gggcgaaaaa 
tttttggggt 
agagcttgac 
gcgggcgcta 
gcgcttaatg 
gaagggcgat 
gcaaggcgat 
gccagtgaat 



atcatgtctg 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
"tgtccgcctt 
tcagttcggt 
ccgaccgctg 
tatcgccact 
ctacagagtt 
tctgcgctct 
aacaaaccac 
aaaaaggatc 
aaaactcacg 
ttttaaatta 
acagttacca 
ccatagttgc 
gccccagtgc 
taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 
gatccagttc 
ccagcgtttc 
cgacacggaa 
agggttattg 
gggttccgcg 
tgacattaac 
gatgacggtg 
gcggatgccg 
ggctggctta 
gaaataccgc 
ttttgttaaa 
aaatcggcaa 
cagtttggaa 
ccgtctatca 
cgaggtgccg 
ggggaaagcc 
gggcgctggc 
cgccgctaca 
cggtgcgggc 
taagttgggt 
tgtaatacga 



gtcgactcta 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 
cgccttatcc 
ggcagcagcc 
cttgaagtgg 
gctgaagcca 
cgctggtagc 
tcaagaagat 
ttaagggatt 
aaaatgaagt 
atgcttaatc 
ctgactcccc 
tgcaatgata 
agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtac 
cccggcgtca 
tggaaaacgt 
gatgtaaccc 
tgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 
ctataaaaat 
aaaacctctg 
ggagcagaca 
actatgcggc 
acagatgcgt 
attcgcgtta 
aatcccttat 
caagagtcca 
gggcgatggc 
taaagcacta 
ggcgaacgtg 
aagtgtagcg 
gggcgcgtcc 
ctcttcgcta 
aacgccaggg 
ctcactatag 



gactcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 
cgctccaagc 
ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 
ggtggttttt 
cctttgatct 
ttggtcatga 
tttaaatcaa 
agtgaggcac 
gtcgtgtaga 
ccgcgagacc 
gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 
actcgtgcac 
aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 
aggcgtatca 
acacatgcag 
agcccgtcag 
atcagagcag 
aaggagaaaa 
aatttttgtt 
aaatcaaaag 
ctattaaaga 
ccactacgtg 
aatcggaacc 
gcgagaaagg 
gtcacgctgc 
cattcgccat 
ttacgccagc 
ttttcccagt 
ggcgaattaa 



cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 
gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 
tctaaagtat 
ctatctcagc 
taactacgat 
cacgctcacc 
gaagtggtcc 
gagtaagtag 
tggtgtcacg 
gagttacatg 
ttgtcagaag 
ctcttactgt 
cattctgaga 
ataccgcgcc 
gaaaactctc 
ccaactgatc 
ggcaaaatgc 
tcctttttca 
ttgaatgtat 
cacctgacgt 
cgaggcccct 
ctcccggaga 
ggcgcgtcag 
attgtactga 
taccgcatca 
aaatcagctc 
aatagaccga 
acgtggactc 
aaccatcacc 
ctaaagggag 
aagggaagaa 
gcgtaaccac 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
ttcgggg 



cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
tagctcacgc 
gcacgaaccc 
caacccggta 
agcgaggtat 
tagaaggaca 
tggtagctct 
gcagcagatt 
gtctgacgct 
aaggatcttc 
atatgagtaa 
gatctgtcta 
acgggagggc 
ggctccagat 
tgcaacttta 
ttcgccagtt 
ctcgtcgttt 
atcccccatg 
taagttggcc 
catgccatcc 
atagtgtatg 
acatagcaga 
aaggatctta 
t t cage ate t 
cgcaaaaaag 
atattattga 
ttagaaaaat 
ctaagaaacc 
ttcgtctcgc 
cggtcacagc 
cgggtgttgg 
gagtgeacca 
ggaaattgta 
attttttaac 
gatagggttg 
caaegtcaaa 
ctaatcaagt 
cccccgattt 
agegaaagga 
cacacccgcc 
caactgttgg 
gggatgtgct 
taaaacgacg 



4740 
4800 
4360 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7607 



<210> 35 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 



lllll Description of Artificial Sequence: plasmid MMTV-E2a.SV40.Neo 

™fa 2SBK 5ESSSS2 2SSS 3S2SS SSSSS S. 
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gagtgtccta cacctagggg agaagcagcc 
tctgcgcaca aacggatgag cccatcagac 
ggcatagctc tgctttgcct ggggctattg 
ctcacccttg actcttttaa tagctcttct 
aactcgacct tcctcctgag gcaaggacca 
ttttgtcctt cagaaataga aataagaatg 
accaatccaa taggtagatt attagttact 
actattttta ctcaaattca gaagttagaa 
acctcaattg aagaacaggt gcaaggacta 
aaaaagagtg tttttgtcaa aataggagac 
ccttacatct acagaccaac agatgccccc 
tgggataggt gggttacagt caatggctat 
gaaagactcg ccagagctag acctccttgg 
atgaaacaac aggtacatga ttatatttat 
attttccata ccaaggaggg gacagtggct 
catggcatga gttattatga atagccttta 
agtaagtttt tggttacaaa ctgttcttaa 
cttggtttgg tatcaaaggt tctgatctga 
ttggaattta tccaaatctt atgtaaatgc 
attttttgag taaacttgca acagtcctaa 
gccatcccgt ctccgctcgt cacttatcct 
cggcgaccct caggtcggcc gactgcggca 
gtgacccttg tctctatttc tactatttgg 
ggctatcatc acaagagcgg aacggactca 
cgtccaagac cctcaaagat ttttggcact 
gcgccctgcc gcaaggccag ctgcttgtcc 
ggtatcttgc agttttggaa aaagatgtga 
acggggtaga agttgaggcg cgggttgggc 
ggtacgcgcg gtgagaatag gtggcgttcg 
ggcacatcgc tgcgctcttg caacgcgtcg 
ttcaacagca cgtcgtctcc cacatctagg 
acttgttcct cgtttgcctc tgcgttgtcc 
gagcggtcct cgtzcgtcttc gcttacaaaa 
tcctcaagcg ggggtgcctc gacggggaag 
gcggtggtgg cgaactcaga gggggcggtt 
atctttttct gcctatagga gaaggaaatg 
acccccgagc gcggacgcgg tgcggcgcga 
ccgtccccgt cgccgccgcc tccccgggcg 
gagtccgagg acgaggaaga ctcatcacaa 
cggccatcga cctcggcggc ggatttggcc 
tctcccaagc ccgagcgccc gccatcacca 
gaagatgtgg cgctacaaat ggtgggtttc 
aaaggaggta agcgcacagt gcggcggctg 
cggacgcaag aggaagagga agagcccagc 
ccgctgagtg tgccgatcgt gtctgcgtgg 
atggacaagt accacgtgga taacgatcta 
gtggaagctc tggcggccgt atgcaagacc 
ctgaccttca ccagcaacaa gacctttgtg 
ctgcagtcgt ttgcagaggt gacctacaag 
ctgcaccgct gcgctgagat cgaaggcgag 
aataaggagc acgtgattga aatggatgtg 
gagcagtcta gcaaggccaa gatcgtgaag 
tccaacaccg acgcaaggtg ctgcgtgcac 
ggcaagtctt gcggcatgtt cttctctgaa 
atcaaggctt ttatgcaggc gctgtatcct 
atgccactac ggtgcgagtg caactcaaag 
ctaccaaagt tgactccgtt cgccctgagc 
tccgacaaga gcgtgctggc cagcgtgcac 
aaccctgtgt atcgcaactc gcgcgcgcag 
tcggcgcccg acctgctaaa cgcgttggtg 
accgagcfcgc cgcggatggt tgtgcctgag 
aacgtgtccc tgccagtggc gcatagcgat 



aaggggttgt ttcccaccaa ggacgacccg 180 
aaagacatat tcattctctg ctgcaaactt 240 
ggggaagttg cggttcgtgc tcgcagggct 300 
gtgcaagatt acaatctaaa caattcggag 3 60 
cagccaactt cctcttacaa gccgcatcga 420 
cttgctaaaa attatatttt taccaataag 480 
atgttaagaa atgaatcatt atcttttagt 540 
atgggaatag aaaatagaaa gagacgctca 600 
ttgaccacag gcctagaagt aaaaaaggga 660 
aggtggtggc aaccagggac ttatagggga 720 
ttaccatata caggaagata tgacttaaat 780 
aaagtgttat atagatccct cccttttcgt 840 
tgtatgttgt ctcaagaaga aaaagacgac 900 
ctaggaacag gaatgcactt ttggggaaag 960 
ggactaatag aacattattc tgcaaaaact 1020 
ttggcccaac cttgcggttc ccagggctta 1080 
aacgaggatg tgagacaagt ggtttcctga 1140 
gctctgagtg ttctattttc ctatgttctt 1200 
ttatgtaaac caagatataa aagagtgctg 1260 
cattcacctc ttgtgtgttt gtgtctgttc 1320 
tcactttcca gagggtcccc ccgcagaccc 1380 
gctggcgccc gaacagggac cctcggataa 1440 
tgtttgtctt gtattgtctc tttcttgtct 1500 
ccatagggac caagctagcg cttctcgtcg 1560 
tcgttgagcg aggcgatatc aggtatgaca 1620 
gctcggctgc ggttggcacg gcaggatagg 1680 
taggtggcaa gcacctctgg cacggcaaat 1740 
tcgcatgtgc cgttttcttg gcgtttgggg 1800 
taggcaaggc tgacatccgc tatggcgagg 1860 
cagataatgg cgcactggcg ctgcagatgc 1920 
tagtcgccat gcctttcgtc cccccgcccg 1980 
tggtcttgct ttttatcctc tgttggtact 2040 
cctgggtcct gctcgataat cacttcctcc 2100 
gtggtaggcg cgttggcggc atcggtggag 2160 
aggctgtcct tcttctcgac tgactccatg 2220 
gccagtcggg aagaggagca gcgcgaaacc 2280 
cgtcccccaa ccatggagga cgtgtcgtcc 2340 
cccccaaaaa agcggatgag gcggcgtatc 2400 
gacgcgctgg tgccgcgcac acccagcccg 2460 
attgcgccca agaagaaaaa gaagcgccct 2520 
gaggtaatcg tggacagcga ggaagaaaga 2580 
agcaacccac cggtgctaat caagcatggc 2640 
aatgaagacg acccagtggc gcgtggtatg 2700 
gaagcggaaa gtgaaattac ggtgatgaac 2760 
gagaagggca tggaggctgc gcgcgcgctg 2820 
aaggcgaact tcaaactact gcctgaccaa 2880 
tggctgaacg aggagcaccg cgggttgcag 2940 
acgatgatgg ggcgattcct gcaggcgtac 3000 
catcacgagc ccacgggctg cgcgttgtgg 3060 
cttaagtgtc tacacggaag cattatgata 3120 
acgagcgaaa acgggcagcg cgcgctgaag 3180 
aaccggtggg gccgaaatgt ggtgcagatc 3240 
gacgcggcct gtccggccaa tcagttttcc 3300 
ggcgcaaagg ctcaggtggc ttttaagcag 3360 
aacgcccaga ccgggcacgg tcaccttttg 3420 
cctgggcacg cgcccttttt gggaaggcag 3480 
aacgcggagg acctggacgc ggatctgatc 3540 
cacccggcgc tgatagtgtt ccagtgctgc 3600 
ggcggaggcc ccaactgcga cttcaagata 3660 
atggtgcgca gcctgtggag tgaaaacttc 3720 
tttaagtgga gcactaaaca ccagtatcgc 3780 
gcgcggcaga acccctttga tttttaaacg 3840 
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gcgcagacgg caagggtggg ggtaaataat 
cctttattga aagtgtctct agtacattat 
gtggcgctcc taatctgcgc actgtggctg 
ctgtagagct gttcctggtt gcgacgcagg 
gagttgggta ccccggtaat aaggttcatg 
cagttggcaa aggcgtggag aaacatgcag 
ccctgtacgc tttgggtgga cttttccagc 
gcgctacggc gcaggagtga ctcgtactca 
gaaaagccaa agggctcaaa gaggtagcat 
atccagtgta cgcccccagt ctcgcgaccg 
tgtggagaaa caaagcctgg aaagcgcttg 
ccaagatctt tgacaatggc tttcagttcc 
gttgatgttg cttgcttctt tatgttgtgg 
acacggtttc gatgacgccg cggtgcggcc 
acaaaacata aagaagggtg ggctcgtcca 
ttacctcagg tcgacctcga gggatctttg 
aattggacaa actacctaca gagatttaaa 
tataatgtgt taaactactg attctaattg 
ctgatgaatg ggagcagtgg tggaatgcct 
aaatgccatc tagtgatgat gaggctactg 
agaagagaaa ggtagaagac cccaaggact 
atgctgtgtt tagtaataga actcttgctt 
ctgcactgct atacaagaaa attatggaaa 
acagttataa tcataacata ctgttttttc 
ttaataacta tgctcaaaaa ttgtgtacct 
aggaatattt gatgtatagt gccttgacta 
gaggttttac ttgctttaaa aaacctccca 
aatgcaattg ttgttgttaa cttgtttatt 
agcatcacaa atttcacaaa taaagcattt 
aaactcatca atgtatctta tcatgtctgg 
tgtggaaagt ccccaggctc cccagcaggc 
tcagcaacca ggtgtggaaa gtccccaggc 
catctcaatt agtcagcaac catagtcccg 
ccgcccagtt ccgcccattc tccgccccat 
gccgaggccg cctcggcctc tgagctattc 
ctaggctttt gcaaaaagct tcacgctgcc 
ggaagcggaa cacgtagaaa gccagtccgc 
gctactgggc tatctggaca agggaaaacg 
gtgggcttac atggcgatag ctagactggg 
tgccagctgg ggcgccctct ggtaaggttg 
tcttgccgcc aaggatctga tggcgcaggg 
gatcgtttcg catgattgaa caagatggat 
agaggctatt cggctatgac tgggcacaac 
tccggctgtc agcgcagggg cgcccggttc 
tgaatgaact gcaggacgag gcagcgcggc 
gcgcagctgt gctcgacgtt gtcactgaag 
tgccggggca ggatctcctg tcatctcacc 
ctgatgcaat gcggcggctg catacgcttg 
cgaaacatcg catcgagcga gcacgtactc 
atctggacga agagcatcag gggctcgcgc 
gcatgcccga cggcgaggat ctcgtcgtga 
tggtggaaaa tggccgcttt tctggattca 
gctatcagga catagcgttg gctacccgtg 
ctgaccgctt cctcgtgctt tacggtatcg 
atcgccttct tgacgagttc ttctgagcgg 
gacgcccaac ctgccatcac gagatttcga 
cttcggaatc gttttccggg acgccggctg 
ggagttcttc gcccaccccg ggctcgatcc 
gctggacgac ctcgcggagt tctaccggca 
cagcggctat ccgcgcatcc atgcccccga 
tttggtcccg gatctttgtg aaggaacctt 
tacctacaga gatttaaagc tctaaggtaa 



cacccgagag tgtacaaata aaagcatttg 3900 
ttttacatgt ttttcaagtg acaaaaagaa 3960 
cggaagtagg gcgagtggcg ctccaggaag 4020 
gtgggctgta cctggggact gttgagcatg 4080 
gtggggttgt gatccatggg agtttggggc 4140 
cagaatagtc cacaggcggc cgagttgggc 4200 
gttatacagc ggtcggggga agaagcaatg 4260 
aactggtaaa cctgcttgag tcgctggtca 4320 
gtttttgagt gcgggttcca ggcaaaggcc 4380 
gccgtattga ctatggcgca ggcgagcttg 4440 
tcataggtgc ccaaaaaata tggcccacaa 4500 
tgctcactgg agcccatggc ggcagctgtt 4560 
cgttgccggc cgagaagggc gtgcgcaggt 4620 
ggtgcacacg gaccacgtca aagacttcaa 4680 
tgggatccat atatagggcc cgggttataa 4740 
tgaaggaacc ttacttctgt ggtgtgacat 4800 
gctctaaggt aaatataaaa tttttaagtg 4860 
tttgtgtatt ttagattcca acctatggaa 4920 
ttaatgagga aaacctgttt tgctcagaag 4980 
ctgactctca acattctact cctccaaaaa 5040 
ttccttcaga attgctaagt tttttgagtc 5100 
gctttgctat ttacaccaca aaggaaaaag 5160 
aatattctgt aacctttata agtaggcata 5220 
ttactccaca caggcataga gtgtctgcta 5280 
ttagcttttt aatttgtaaa ggggttaata 5340 
gagatcataa tcagccatac cacatttgta 5400 
cacctccccc tgaacctgaa acataaaatg 5460 
gcagcttata atggttacaa ataaagcaat 5520 
•ttttcactgc attctagttg tggtttgtcc 5580 
atccggctgt ggaatgtgtg tcagttaggg 5640 
agaagtatgc aaagcatgca tctcaattag 5700 
tccccagcag gcagaagtat gcaaagcatg 5760 
cccctaactc cgcccatccc gcccctaact 5820 
ggctgactaa ttttttttat ttatgcagag 5880 
cagaagtagt gaggaggctt ttttggaggc 5940 
gcaagcactc agggcgcaag ggctgctaaa 6000 
agaaacggtg ctgaccccgg atgaatgtca 6060 
caagcgcaaa gagaaagcag gtagcttgca 6120 
cggttttatg gacagcaagc gaaccggaat 6180 
ggaagccctg caaagtaaac tggatggctt 6240 
gatcaagatc tgatcaagag acaggatgag 6300 
tgcacgcagg ttctccggcc gcttgggtgg 6360 
agacaatcgg ctgctctgat gccgccgtgt 6420 
tttttgtcaa gaccgacctg tccggtgccc 6480 
tatcgtggct ggccacgacg ggcgttcctt 6540 
cgggaaggga ctggctgcta ttgggcgaag 6600 
ttgctcctgc cgagaaagta tccatcatgg 6660 
atccggctac ctgcccattc gaccaccaag 6720 
ggatggaagc cggtcttgtc gatcaggatg 6780 
cagccgaact gttcgccagg ctcaaggcgc 6840 
cccatggcga tgcctgcttg ccgaatatca 6900 
tcgactgtgg ccggctgggt gtggcggacc 6960 
atattgctga agagcttggc ggcgaatggg 7020 
ccgctcccga ttcgcagcgc atcgccttct 7080 
gactctgggg ttcgaaatga ccgaccaagc 7140 
ttccaccgcc gccttctatg aaaggttggg 7200 
gatgatcctc cagcgcgggg atctcatgct 7260 
cctcgcgagt tggttcagct gctgcctgag 7320 
gtgcaaatcc gtcggcatcc aggaaaccag 7380 
actgcaggag tggggaggca cgatggccgc 7440 
acttctgtgg tgtgacataa ttggacaaac 7500 
atataaaatt tttaagtgta taatgtgtta 7560 
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aactactgat 
agcagtggtg 
gtgatgatga 
tagaagaccc 
gtaatagaac 
acaagaaaat 
ataacatact 
ctcaaaaatt 
tgtatagtgc 
gctttaaaaa 
gttgttaact 
ttcacaaata 
gtatcttatc 
ctctacttga 
aattaggtca 
ggtaatttta 
cagcccacaa 
accctgctca 
attttcccca 
aacccagcac 
catctgctga 
cctgtagttt 
aaaatttgac 
atgcaagttt 
catcaaaata 
cgaaagggcc 
tagacgtcag 
taaatacatt 
tattgaaaaa 
gcggcatttt 
gaagatcagt 
cttgagagtt 
tgtggcgcgg 
tattctcaga 
atgacagtaa 
ttacttctga 
gatcatgtaa 
gagcgtgaca 
gaactactta 
gcaggaccac 
gccggtgagc 
cgtatcgtag 
atcgctgaga 
tatatacttt 
ctttttgata 
gaccccgtag 
tgcttgcaaa 
ccaactcttt 
ctagtgtagc 
gctctgctaa 
ttggactcaa 
tgcacacagc 
ctatgagaaa 
agggtcggaa 
agtcctgtcg 
gggcggagcc 
tggccttttg 
accgcctttg 
gtgagcgagg 
atttcacacc 
cagtatctgc 
gctacaacaa 



tctaattgtt 
gaatgccttt 
ggctactgct 
caaggacttt 
tcttgcttgc 
tatggaaaaa 
gttttttctt 
gtgtaccttt 
cttgactaga 
acctcccaca 
tgtttattgc 
aagcattttt 
atgtctggat 
gaggacattc 
cttaacaaaa 
aaatatctgg 
atgtcaacag 
tcaagaagca 
cctgtgtagg 
tccactggat 
ctgtcaactg 
gctaacacac 
ccttgaatgg 
aacatagcag 
tttccacagg 
tcgtgatacg 
gtggcacttt 
caaatatgta 
ggaagagtat 
gccttcctgt 

tgggtgcacg 

ttcgccccga 
tattatcccg 
atgacttggt 
gagaattatg 
caacgatcgg 
ctcgccttga 
ccacgatgcc 
ctctagcttc 
ttctgcgctc 
gtgggtctcg 
ttatctacac 
taggtgcctc 
agattgattt 
atctcatgac 
aaaagatcaa 
caaaaaaacc 
ttccgaaggt 
cgtagttagg 
tcctgttacc 
gacgatagtt 
ccagcttgga 
gcgccacgct 
caggagagcg 
ggtttcgcca 
tatggaaaaa 
ctcacatgtt 
agtgagctga 
aagcggaaga 
gcatatggtg 
tccctgcttg 
ggcaaggctt 



tgtgtatttt 

aatgaggaaa 

gactctcaac 

ccttcagaat 

tttgctattt 

tattctgtaa 

actccacaca 

agctttttaa 

gatcataatc 

cctccccctg 

agcttataat 

ttcactgcat 

ccccaggaag 

caatcatagg 

aggaaattgg 

gaagtccctt 

cagaaacata 

ctgtggttgc 

ttccaaaata 

aagcattatc 

tagcattttt 

cctgcagctc 

gttttccagc 

ttaccccaat 

ttaagtcctc 

cctattttta 

tcggggaaat 

tccgctcatg 

gagtattcaa 

ttttgctcac 

agtgggttac 

agaacgtttt 

tgttgacgcc 

tgagtactca 

cagtgctgcc 

aggaccgaag 

tcgttgggaa 

tgcagcaatg 

ccggcaacaa 

ggcccttccg 

cggtatcatt 

gacggggagt 

actgattaag 

aaaacttcat 

caaaatccct 

aggatcttct 

accgctacca 

aactggcttc 

ccaccacttc 

agtggctgct 

accggataag 

gcgaacgacc 

tcccgaaggg 

cacgagggag 

cctctgactt 

cgccagcaac . 

ctttcctgcg 

taccgctcgc 

gcgcctgatg 

cactctcagt 

tgtgttggag 

gaccgacaat 



agattccaac 
acctgttttg 
attctactcc 
tgctaagttt 
acaccacaaa 
cctttataag 
ggcatagagt 
tttgtaaagg 
agccatacca 
aacctgaaac 
ggttacaaat 
tctagttgtg 
ctcctctgtg 
ctgcccatcc 
gtaggggttt 
ccactgctgt 
caagctgtca 
tgtgttagta 
tctagtgttt 
cttatccaaa 
tggggttaca 
caaaggttcc 
accattttca 
aacctcagtt 
atttaaatta 
taggttaatg 
gtgcgcggaa 
agacaataac 
catttccgtg 
ccagaaacgc 
atcgaactgg 
ccaatgatga 
gggcaagagc 
ccagtcacag 
ataaccatga 
gagctaaccg 
ccggagctga 
gcaacaacgt 
ttaatagact 
gctggctggt 
gcagcactgg 
caggcaacta 
cattggtaac 
ttttaattta 
taacgtgagt 
tgagatcctt 
gcggtggttt 
agcagagcgc 
aagaactctg 
gccagtggcg 
gcgcagcggt 
tacaccgaac 
agaaaggcgg 
cttccagggg 
gagcgtcgat 
gcggcctttt 
ttatcccctg 
cgcagccgaa 
cggtattttc 
acaatctgct 
gtcgctgagt 
tgcatgaaga 



ctatggaact 
ctcagaagaa 
tccaeaaaaag 
tttgagtcat 
ggaaaaagct 
taggcataac 
gtctgctatt 
ggttaataag 
catttgtaga 
ataaaatgaa 
aaagcaatag 
gtttgtccaa 
tcctcataaa 
accctctgtg 
ttcacagacc 
gttccagaag 
gctttgcaca 
atgtgcaaaa 
tcatttttac 
acagccttgt 
gtttgagcag 
ccaccaacag 
tgagtttttt 
ttaacagtaa 
ggcaaaggaa 
tcatgataat 
cccctatttg 
cctgataaat 
tcgcccttat 
tggtgaaagt 
atctcaacag 
gcacttttaa 
aactcggtcg 
aaaagcatct 
gtgataacac 
cttttttgca 
atgaagccat 
tgcgcaaact 
ggatggaggc 
ttattgctga 
ggccagatgg 
tggatgaacg 
tgtcagacca 
aaaggatcta 
tttcgttcca 
tttttctgcg 
gtttgccgga 
agataccaaa 
tagcaccgcc 
ataagtcgtg 
cgggctgaac 
tgagatacct 
acaggtatcc 
gaaacgcctg 
ttttgtgatg 
tacggttcct 
attctgtgga 
cgaccgagcg 
tccttacgca 
ctgatgccgc 
agtgcgcgag 
atctgcttag 



gatgaatggg 
atgccatcta 
aagagaaagg 
gctgtgttta 
gcactgctat 
agttataatc 
aataactatg 
gaatatttga 
ggttttactt 
tgcaattgtt 
catcacaaat 
actcatcaat 
ccctaacctc 
tcctcctgtt 
gctttctaag 
tgttggtaaa 
agggcccaac 
caggaggcac 
ttggatcagg 
ggtcagtgtt 
gatatttggt 
caaaaaaatg 
gtgtccctga. 
cagcttccca 
ttcttgaaga 
aatggtttct 
tttatttttc 
gettcaataa 
tccctttttt 
aaaagatgct 
cggtaagatc 
agttctgcta 
ccgcatacac 
tacggatggc 
tgcggccaac 
caacatgggg 
accaaacgac 
attaactggc 
ggataaagtt 
taaatctgga 
taagccctcc 
aaatagacag 
agtttactca 
ggtga agate 
ctgagegtea 
cgtaatctgc 
tcaagagcta 
tactgtcctt 
tacatacctc 
tettaceggg 

ggggggttcg 

acagegtgag 
ggtaagegge 
gtatctttat 
ctegtcaggg 
ggccttttgc 
taacegtatt 
cagegagtea 
tctgtgcggt 
atagttaagc 
caaaatttaa 
ggttaggcgt 
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tttgcgctgc ttcgcgatgt acgggccaga tatacgcgta tctgagggga ctagggtgtg 11340 
tttaggcgaa aagcggggct tcggttgtac gcggttagga gtcccctcag gatatagtag 11400 
tttcgctttt gcatagggag ggggaaatgt agtcttatgc aatacacttg tagtcttgca 11460 
acatggtaac gatgagttag caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc 11520 
gattggtgga agtaaggtgg tacgatcgtg ccttattagg aaggcaacag acgggtctga 11580 
catggattgg acgaaccact , 11600 

<210> 36 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

gtcactcgag gactcggtcg actgaaaatg agacatatta tctgccacgg acc 53 



<210> 37 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 37 

cgagatcgat cacctccggt acaaggtttg gcatag 

<210> 38 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

catgaagatc tggaaggtgc tgaggtacga tgagacc 

<210> 39 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

gcgacttaag cagtcagctg agacagcaag acacttgctt gatccaaatc c 

<210> 40 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 40 

cacgaattcg tcagcgcttc tcgtcgcgtc caagaccc 



38 
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<210> 41 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

caccccgggg aggcggcggc gacggggacg gg 

<210> 42 
<211> 7231 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 



plasmid pDV80 



<400> 42 

ctgctccctg 

acaaggcaag 

ctgcttcgcg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 

ccgagctcgg 

tactccctct 

gaggatttga 

tcagaaaaga 

gacagcaact 

gccgcgatgt 

gtgcgctcgt 

acgctggtgg 

cgcgagcaga 

gtaaagaccc 

agcgcctgct 

catggcatgg 

ggctctctga 

acgtaatcgt 

ggctgctctg 

ggacgctgga 

gcgtaggagt 

tagtccaggg 

cggttgagga 

tccgaacgag 

ggaaaacctc 

gcccgcccca 

aatatcccct 

ggggtactgt 

aaggtgggag 

ctgcaagtta 

gctaataaac 



cttgtgtgtt 
gcttgaccga 
atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
atccactctc 
gaaaagcggg 
tattcacctg 
caatcttttt 
tggcgatgga 
ttagctgcac 
cgggcaccag 
ctacctctcc 
atggcggtag 
cgggcagcag 
gccatgcgcg 
ggtgggtgag 
gtattccaag 
atagttcgtg 
ctcggaagac 
agacgttgaa 
cgcgcagctt 
tttccttgat 
caaactcttc 
atccgtactc 
tcgagaaagg 
gcgaagatga 
tcctcactcc 
cactcaaact 
gtggtctcac 
atactgataa 
ttagtttaaa 



ggaggtcgct 
caattgcatg 
cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
ttccgcatcg 
catgacttct 
gcccgcggtg 
gttgtcaagc 
gcgcagggtt 
gtattcgcgc 
gtgcacgcgc 
gcgtaggcgc 

ggggtctagc 

gcgcgcgtcg 
ggcggcaagc 
cgcggaggcg 
atatgtaggg 
cgagggagcg 
tatctgcctg 
gctggcgtct 
gttgaccagc 
gatgtcatac 
gcggtctttc 
cgccgccgag 
cgtctaacca 
cttcaacccc 
cccctttgtc 
ggctgatcca 
tttgcaagat 
aaaacttgag 
agtaggacat 



gagtagtgcg 

aagaatctgc 

cgttgacatt 

agcccatata 

cccaacgacc 

gggactttcc 

catcaagtgt 

gcctggcatt 

gtattagbca 

tagcggtttg 

ttttggcacc 

caaatgggcg 

agagaaccca 

gctggctagc 

ctgtctgcga 

gcgctaagat 

atgcctttga 

ttggtggcaa 

tggtttttgt 

gcaacgcacc 

caaccgcggt 

tcgttggtcc 

tgcgtctcgt 

aagtagtcta 

gcgcgctcgt 

tacatgccgc 

tagcatcttc 

aggaggtcgg 

aagatggcat 

gtgagaccta 

tcggcggtga 

ttatcctgtc 

cagtactctt 

ggacctgagc 

gtcacagtcg 

gtctacccct 

tcctccgatg 

atcaccatta 

ggaagcctaa 

cttgcatatg 

ggattaaaag 



cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 

gtttaaactt 

gggccagctg 

tgtcagtttc 

gggtggccgc 

acgacccgta 

cgcgatcggc 

gccattcggg 

tgtgcagggt 

agcagaggcg 

ccggggggtc 

tcttgcatcc 

atgggttgag 

aaatgtcgta 

caccgcggat 

gaccgaggtt 

gtgagttgga 

ccgcgtcacg 

cctgcacgtc 

cctttttttt 

ggatcggaaa 

gagtccgcat 

caagatccaa 

atggctacgc 

gattcaaaaa 

ccaatgggga 

ctgtaaaccc 

ataatccatt 

tattagatga 



ttaagctaca 

gcgttttgcg 

tagttattaa 

cgttacataa 

gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 

aagcttggta 

ttggggtgag 

caaaaacgag 

atccatctgg 

gagggcgttg 

gcgctccttg 

aaagacggtg 

gacaaggtca 

gccgcccttg 

tgcgtccacg 

ttgcaagtct 

tgggggaccc 

aacgtagagg 

gctggcgcgc 

gctacgggcg 

tgatatggtt 

cacgaaggag 

tagggcgcag 

ccacagctcg 

cccgtcggcc 

cgaccggatc 

gatgaagcgc 

gcggaatcag 

cttcccccct 

tgtatccctc 

taaggctcca 

tgaaagtagt 

aaaaagtgct 
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gcggggttaa 
gaaaatttag 
gaagggttga 
cgcacacttt 
tctaaactca 
attgtggtcg 
tttactatta 
aaagcttatt 
attggtttta 
agagacatag 
aaaactacct 
tccaaaacct 
caagaatgaa 
cgactgtgcc 
ccctggaagg 
gtctgagtag 
attgggaaga 
aaagaaccag 
cggcgggtgt 
ctcctttcgc 
taaatcgggg 
aacttgatta 
ctttgacgtt 
tcaaccctat 
ggttaaaaaa 
tcagttaggg 
atctcaatta 
tgcaaagcat 
cgcccctaac 
tttatgcaga 
tttttggagg 
ctgatcagca 
caaggtgagg 
cgacgtcgcc 
ggaggacgac 
ggaccaggtg 
gtacgccgag 
gaccgagatc 
ctgcgtgcac 
cgccgccttc 
cctccagcgc 
ttataatggt 
actgcattct 
gtcgacctct 
ttatccgctc 
tgcctaatga 
gggaaacctg 
gcgtattggg 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 
cgccttatcc 
ggcagcagcc 
cttgaagtgg 
gctgaagcca 
cgctggtagc 
tcaagaagat 
ttaagggatt 



aagatttaat 
aaaatacaga 
catttgacaa 
ggacaacacc 
ctttggtact 
caggaaagta 
aactgctatt 
ggaactttag 
tgcctaattt 
tttatggaac 
ttaaccaaga 
atgaaaatgt 
agagcggccg 
ttctagttgc 
tgccactccc 
gtgtcattct 
caatagcagg 
ctggggctct 
ggtggttacg 
tttcttccct 
catcccttta 
gggtgatggt 
ggagtccacg 
ctcggtctat 
tgagctgatt 
tgtggaaagt 
gtcagcaacc 
gcatctcaat 
tccgcccagt 
ggccgaggcc 
cctaggcttt 
cgtgttgaca 
aactaaacca 
ggagcggtcg 
ttcgccggtg 
gtgccggaca 
tggtcggagg 
ggcgagcagc 
ttcgtggccg 
tatgaaaggt 
ggggatctca 
tacaaataaa 
agttgtggtt 
agctagagct 
acaattccac 
gtgagctaac 
tcgtgccagc 
cgctcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 
cgctccaagc 
ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 
ggtggttttt 
cctttgatct 
ttggtcatga 



tggcaaactt 
tggtagcagc 
tgatggatac 
agacacatct 
tacaaagtgt 
ccacatcata 
taataagaac 
aagtggaaat 
ggtagcgtat 
tatatatctt 
aactggatgt 
tgaatttgaa 
ctcgagtcta 
cagccatctg 
actgtccttt 
attctggggg 
catgctgggg 
agggggtatc 
cgcagcgtga 
tcctttctcg 
gggttccgat 
tcacgtagtg 
ttctttaata 
tcttttgatt 
taacaaaaat 
ccccaggctc 
aggtgtggaa 
tagtcagcaa 
tccgcccatt 
gcctctgcct 
tgcaaaaagc 
attaatcatc 
tggccaagtt 
agttctggac 
tggtccggga 
acaccctggc 
tcgtgtccac 
cgtgggggcg 
aggagcagga 
tgggcttcgg 
tgctggagtt 
gcaatagcat 
tgtccaaact 
tggcgtaatc 
acaacatacg 
tcacattaat 
tgcattaatg 
cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 
gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 



gtggttttaa 
agaggaattg 
ttggtagcat 
ccaaactgca 
ggaagtcaaa 
aataataaga 
ggagtgcttt 
tccaatgttt 
ccaaaaccca 
ggtggaaaac 
gaatactcta 
accacctctt 
gagggcccgt 
ttgtttgccc 
cctaataaaa 
gtggggtggg 
atgcggtggg 
cccacgcgcc 
ccgctacact 
ccacgttcgc 
ttagtgcttt 
ggccatcgcc 
gtggactctt 
tataagggat 
ttaacgcgaa 
cccaggcagg 
agtccccagg 
ccatagtccc 
ctccgcccca 
ctgagctatt 
tcccgggagc 
ggcatagtat 
gaccagtgcc 
cgaccggctc 
cgacgtgacc 
ctgggtgtgg 
gaacttccgg 
ggagttcgcc 
ctgacacgtg 
aatcgttttc 
cttcgcccac 
cacaaatttc 
catcaatgta 
atggtcatag 
agccggaagc 
tgcgttgcgc 
aatcggccaa 
cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
atgctcacgc 
gcacgaaccc 
caacccggta 
agcgaggtat 
tagaaggaca 
tggtagctct 
gcagcagatt 
gtctgacgct 
aaggatcttc 



caggaaaagg 
gtataaatgt 
ggaacccaaa 
caattgctca 
tattagctaa 
caaatccaaa 
tagacaactc 
cgacagctta 
gtaattctaa 
ctgatcagcc 
tcacatttaa 
ttaccttctc 
ttaaacccgc 
ctcccccgtg 
tgaggaaatt 
gcaggacagc 
ctctatggct 
ctgtagcggc 
tgccagcgcc 
cggctttccc 
acggcacctc 
ctgatagacg 
gttccaaact 
tttggggatt 
ttaattctgt 
cagaagtatg 
ctccccagca 
gcccctaact 
tggctgacta 
ccagaagtag 
ttgtatatcc 
atcggcatag 
gttccggtgc 
gggttctccc 
ctgttcatca 
gtgcgcggcc 
gacgcctccg 
ctgcgcgacc 
ctacgagatt 
cgggacgccg 
cccaacttgt 
acaaataaag 
tcttatcatg 
ctgtttcctg 
ataaagtgta 
tcactgcccg 
cgcgcgggga 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 
agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 



aataggcact 
aagagcaaga 
gtatgacacg 
agataaggac 
tgtgtctttg 
aataaaaagt 
aaatcttgga 
tgaaaaagca 
aaaatatgca 
agcagtcatt 
ctttagttgg 
ctatattgcc 
tgatcagcct 
ccttccttga 
gcatcgcatt 
aagggggagg 
tctgaggcgg 
gcattaagcg 
ctagcgcccg 
cgtcaagctc 
gaccccaaaa 
gtttttcgcc 
ggaacaacac 
tcggcctatt 
ggaatgtgtg 
caaagcatgc 
ggcagaagta 
ccgcccatcc 
atttttttta 
tgaggaggct 
attttcggat 
tataatacga 
tcaccgcgcg 
gggacttcgt 
gcgcggtcca 
tggacgagct 
ggccggccat 
cggccggcaa 
tcgattccac 
gctggatgat 
ttattgcagc 
catttttttc 
tctgtatacc 
tgtgaaattg 
aagcctgggg 
ctttccagtc 
gaggcggttt 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
tgtccgcctt 
tcagttcggt 
ccgaccgctg 
tatcgccact 
ctacagagtt 
tctgcgctct 
aacaaaccac 
aaaaaggatc 
aaaactcacg 
ttttaaatta 
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4980 
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aaaatgaagt 
atgcttaatc 
ctgactcccc 
tgcaatgata 
agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtac 
cccggcgtca 
tggaaaacgt 
gatgtaaccc 
tgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 



tttaaatcaa 
agtgaggcac 
gtcgtgtaga 
ccgcgagacc 
gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 
actcgtgcac 
aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 



tctaaagtat 

ctatctcagc 

taactacgat 

cacgctcacc 

gaagtggtcc 

gagtaagtag 

tggtgtcacg 

gagttacatg 

ttgtcagaag 

ctcttactgt 

cattctgaga 

ataccgcgcc 

gaaaactctc 

ccaactgatc 

ggcaaaatgc 

tcctttttca 

ttgaatgtat 

cacctgacgt 



atatgagtaa 
gatctgtcta 
acgggagggc 
ggctccagat 
tgcaacttta 
ttcgccagtt 
ctcgtcgttt 
atcccccatg 
taagttggcc 
catgccatcc 
atagtgtatg 
acatagcaga 
aaggatctta 
ttcagcatct 
cgcaaaaaag 
atattattga 
ttagaaaaat 



acttggtctg 

tttcgttcat 

ttaccatctg 

ttatcagcaa 

tccgcctcca 

aatagtttgc 

ggtatggctt 

ttgtgcaaaa 

gcagtgttat 

gtaagatgct 

cggcgaccga 

actttaaaag 

ccgctgttga 

tttactttca 

ggaataaggg 

agcatttatc 

aaacaaatag 



acagttacca 
ccatagttgc 
gccccagtgc 
taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 
gatccagttc 
ccagcgtttc 
cgacacggaa 
agggttattg 

gggttccgcg 



6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7231 



<210> 43 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: primer 
tgtcttglat ccaagatgaa gcgcgcccgc cccagcgaag atgacttc 
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<210> 44 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: primer 

<400> 44 28 
aaacacggcg gccgctcttt cattcttg 

<210> 45 
<211> 17 
<212> PRT 

<213> Ad 37 N- terminus 

Me2°s e r 5 Lys Arg. Leu Arb Val Olu Asp Asp Phe Asn Pro Val Pro Tyr 

1 5 

<210> 46 
<211> 6 
<212> PRT 

<213> artificial sequence 
<400> 46 

Lys Arg Ala Arg Pro Ser 
1 5 



<210> 47 
<211> 17 
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<212> PRT 

<213> AdS modified N-terminus 

<400> 47 „ _ _ 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

<210> 48 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 48 

ggatccatgg gatacttggt agca 24 

<210> 49 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 49 

gcaactcgag tcattcttgg gcaatatagg 30 

<210> 50 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 50 

cgcgctgact. cttaaggact agtttc 26 

<210> 51 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 51 

gcgcttaatt aacatcatca ataatatacc ttatttt 



37 



